[1] "species" "island" "bill_length_mm"
[4] "bill_depth_mm" "flipper_length_mm" "body_mass_g"
[7] "sex" "year"
Data Scientist @ Jumping Rivers:
Python & R support for clients.
Teach courses in programming, SQL, ML.
Enjoy the outdoors & travel.
Organise North East & Leeds data science meetups.
Beginners guide to MLOps
Using free / open source software (as much as possible)
Walk through the steps of building and deploying a model
MLOps tips & tricks
↗ jumpingrivers.com 𝕏 @jumping_uk
Formerly RStudio
JR is an official partner and assists clients with:
Posit maintains free and open source frameworks including:
Compatible with R, Python and more!
Palmer Penguin dataset
Using {tidyr} and {rsample}:
species:Convert our {tidymodels} model to a {vetiver} model:
Contains all the info needed to version, store and deploy our model!
Retrieve a model
Inspect the stored versions
We deploy models as APIs which take input data and send back model predictions.
APIs can be hosted at public endpoints on the web.
We can run them on the localhost (during testing / development).
{vetiver} uses {plumber} to create a model API.
Our Dockerfile contains a series of commands to:
Install the system libraries (Windows|Mac|Linux).
Set the R version and install the required R packages.
Run the API in the deployment environment.
Use automated CI/CD to build the API in the cloud environment.
Vetiver is available for both Python and R!
In Python you would use Python ML libraries rather than {tidymodels}
Vetiver documentation: vetiver.posit.co
Retraining and redeployment can happen at the click of a button.
Encourages good practices like model versioning.
Reduces human error.
Well defined and reproducible.
Consider whether it is worth the cost/effort before starting.