Introduction to Docker and API deployment

This is a tutorial on how to use Docker desktop to deploy a trained machine learning model. There are many other use cases for Docker i.e. deployment of Shiny apps to docker containers, website deployment, web service deployment, database deployment or full scale application deployment.

The session was part of the NHS-R communities' workshop.

My tutorial will focus on the following:

Introduction to Docker
Docker Desktop
Training ML Model, Serialise ML model and deploy to Docker container
Creating a Dockerfile and the Plumber API end points
Use CMD to interact with the service (Windows focus)
Expose Swagger APU and use R to connect to API and send predictions to the trained ML model
Interface with our API using httr and JSONlite

Links to the tutorial content

The following links send you straight to the content:

Presentation from the day
ML Model training script - this trains a model in caret and serialises it to the model folder in the Rocker_Deployment_R folder
Rocker Deployment Folder - this contains all the code needed to create the Docker image such as our PlumbeR end point scripts, our GET and POST functions in StrandedPlumberAPIHC.R, the Open API YAML file needed for setting up the structure of the Swagger endpoint, the serialised model (tb_model.rds) and the most important Dockerfile.
Using Powershell to deploy model - this shows how to use command line to deploy the model in a couple of easy steps
Accessing our API once deployed - this folder contains a script that takes unseen take, these are patients the model was not trained on, and uses our API to fetch the trained model, pass the unseen data through the model, the model spits out results and then these are bound back to the production data as probability estimates and class labels. It also converts a data frame into a JSON object and uses this object to pass to the API. The API returns JSON and then it converts it back to a data.frame. A bit of R and JSON magic!

Want to follow along on YouTube?

This tutorial originally appeared on my YouTube channel. The links to the relevant videos and blogs are below:

Deploying a CARET Machine Learning model as an API with PlumbeR - this shows how to create the ML model, swagger endpoint, create the end point files needed and the OpenAPI.yaml file
Creating a classification model from scratch with TidyModels - this shows an alternate approach to it, instead of CARET replace with TidyModels.
Assessing classification model with ConfusionTableR and outputting matrix to database - this will show you how to use the Confusion Matrix object of R and then beable to store the results into a database with ConfusionTableR.
Deploying our model to Docker - this steps you through how to create the Docker file, get everything in a docker folder for deployment, deploy to Docker with Powershell / CMD and then to consume the endpoint with swagger and JSON - making the model platform agnostic.
Accessing API and making predictions - this will show you how to use the Swagger API to make predictions on production / unseen data and return the results back to R in JSON. Then we convert the JSON and push it back out.
Full article taking you through model training and deploying our model to Docker - this is a link to the full article on my website.

Need help with putting your model into production?

I have been doing lots with MLOps recently and have some practical tips for scaling the model up beyond this fully open-source solution, so please drop me a line if you want any help?

statsgary / nhs_r_community_intro_to_docker Goto Github PK

nhs_r_community_intro_to_docker's Introduction

Introduction to Docker and API deployment

Links to the tutorial content

Want to follow along on YouTube?

Need help with putting your model into production?

nhs_r_community_intro_to_docker's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent