Glossary

Machine Learning Operations (MLOps)

Discover the power of MLOps: streamline ML model deployment, automate workflows, ensure reliability, and scale AI success efficiently.

Machine Learning Operations (MLOps) is a set of practices that aims to deploy and maintain Machine Learning (ML) models in production reliably and efficiently. Drawing inspiration from DevOps principles, MLOps applies similar concepts to the entire AI model lifecycle, from data gathering and model training to deployment and monitoring. The primary goal is to automate and streamline the processes involved in taking an ML model from a research prototype to a robust, scalable production application. This ensures that models not only perform well initially but also remain effective over time as new data becomes available.

Real-World Applications

MLOps practices are essential for managing complex ML systems in production environments.

Recommendation Systems: Companies like Netflix or Spotify use MLOps to continuously retrain their recommendation system models based on new user interaction data. MLOps pipelines enable them to A/B test different model versions, monitor engagement metrics, and quickly roll back underperforming models, ensuring recommendations stay fresh and personalized.
Fraud Detection: Financial institutions deploy MLOps to manage fraud detection models. This involves monitoring transaction data for new patterns of fraudulent activity, automatically retraining models with new data, ensuring low inference latency for real-time detection, and maintaining audit trails for regulatory compliance. Ultralytics YOLO models used in visual inspection systems, which can feed into fraud detection, also benefit from MLOps for deployment and monitoring on edge devices.

Tools and Platforms

A variety of tools support different stages of the MLOps lifecycle, enabling teams to build efficient and scalable workflows.

Experiment Tracking & Versioning: MLflow, Weights & Biases, and Data Version Control (DVC) are popular for tracking experiments and versioning data and models. Ultralytics offers native integration with many of these tools, including Weights & Biases.
Workflow Orchestration: Tools like Kubeflow Pipelines and Apache Airflow help automate and manage complex ML workflows.
Model Serving: For deploying models at scale, platforms like KServe, BentoML, and NVIDIA Triton Inference Server are widely used. Ultralytics models can be exported to formats like ONNX for compatibility with these servers.
Monitoring: Grafana and Prometheus are often used to create dashboards for monitoring model performance and system health.
End-to-End Platforms: Comprehensive platforms like Amazon SageMaker, Google Cloud AI Platform, Microsoft Azure Machine Learning, and Ultralytics HUB provide integrated environments that cover most, if not all, MLOps stages, from data management to model deployment and monitoring. These platforms leverage cloud computing to provide scalable resources for training custom models.

Machine Learning Operations (MLOps)

Flexible enterprise licensing solution to power your innovation

Train AI models in seconds with Ultralytics YOLO

Train YOLO models simply with Ultralytics HUB

Real-World Applications

Tools and Platforms

Read more in this category

Manufacturing ERP Guide

Manufacturing execution system (MES): AI-driven production

Understanding additive manufacturing: Technology & use cases

Join the Ultralytics community

Machine Learning Operations (MLOps)

Flexible enterprise licensing solution to power your innovation

Train AI models in seconds with Ultralytics YOLO

Train YOLO models simply with Ultralytics HUB

MLOps Vs. Related Concepts

Real-World Applications

Tools and Platforms

Read more in this category

Manufacturing ERP Guide

Manufacturing execution system (MES): AI-driven production

Understanding additive manufacturing: Technology & use cases

Join the Ultralytics community