Characteristics of ML CI/CD Pipelines

mediumThis post was originally published by at Medium [AI]

In this blog post, we will be going through the components of building and maintaining CI/CD pipelines for Machine Learning.

 
Photo by Arnold Francisca on Unsplash

[a] Build Phase / Pipeline which results in creation of ML artifact

1. Build the artifact
2. Persist the artifact
3. Sanity check / Smoke testing
4. Generate explainability report

[b] Deploy to test environment

1. Manual Validation of artifact
2. Execution of performance tests (computational, validation, etc)

[c] Deploy to Production environment

1.  Canary or blue-green deployment
2. Full deployment
3. Release deployment

ML artifact

An ML artifact is comprised on the following:

  • Hyperparameters and configurations
  • Trained runnable model
  • Environment variables (libraries, versions, environment variables etc)
  • Documentation
  • Code and data for validation

Considerations for Test Deployment

When deploying a model in test environments, ensure completeness of test cases. Additionally, the test cases should not only provide coverage of validation but also should be able to identify the source of failures.

Considerations for Production Deployments and Release

When deploying in production, we need to consider a few basic permutations

  1. Single model, Single version deployed on Multiple servers
  2. Single model, Multiple versions deployed on Single server
  3. Single model, Multiple versions deployed on Multiple Servers
  4. Multiple models, Multiple versions deployed on Multiple Servers

Spread the word

This post was originally published by at Medium [AI]

Related posts