Tune in to YOLO Vision 2024!
Optimize and deploy YOLOv5 models with Deci's platform, enhancing performance by up to 10x. Get started for free and leverage automatic model optimization.
At Ultralytics we commercially partner with other startups to help us fund the research and development of our awesome open-source tools, like YOLOv5, to keep them free for everybody. This article may contain affiliate links to those partners.
The Deci platform includes free tools for easily managing, optimizing, and deploying your YOLOv5 models in any production environment. Deci supports all popular DL frameworks, such as TensorFlow, PyTorch, Keras, and ONNX. All you need is our web-based platform or our Python client to run it from your code.
You can use Deci for not only exporting but also for pruning and quantization of the model!
Deci provides a nice interface for exporting in any format and performance comparison between the original and converted models. Users choose to further optimize their models by quantization.
Automatically compile and quantize your models and evaluate different production settings to achieve better latency, throughout, and reduction of the model size and memory footprint on your hardware.
Benchmark your model's performance on various hardware (including edge) devices with a button. Eliminate the need to manually set up and test multiple hardware and production settings.
Leverage Deci's python-based inference engine. Compatible with multiple frameworks and hardware types.
For more information about the Deci Platform please visit Deci's website.
Open your free account.
To start optimizing your pre-trained YOLOv5 model, you will need to convert it to ONNX format. See YOLOv5 Export Tutorial for instructions on how to convert your model to ONNX format.
Go to the "Lab" tab and click the "New Model" button in the top right part of the screen to upload your YOLOv5 ONNX model.
Follow the steps of the model upload wizard to select your target hardware as well as desired batch size and quantization level for the model compilation.
After filling in the relevant information, click "Start". The Deci platform will automatically perform a runtime optimization of your YOLOv5 model for the hardware you selected as well as benchmark your model on various hardware types. This process takes approximately 10 minutes.
Once done, a new row will appear on your screen underneath the baseline model you previously uploaded. Here you can see the optimized version of your pre-trained YOLOv5 model.
You can then download your optimized model by clicking on the "Deploy" button.
You will then be prompted to download your model and receive instructions on how to install and use Infery - Deci's runtime inference engine.
The use of Infery is optional. You can get the python raw files and use them with any other inference engine of your choice.
Explore the optimization and benchmark results on the "Insights" tab.
Before wrapping up, let’s discuss some of the advantages Deci offers:
As you have just seen, you can double the performance of a YOLOv5 model in 15 minutes overall time. The Deci platform is super easy and intuitive to use.
Any questions? Join our community and leave your question today!
Begin your journey with the future of machine learning