Meet YOLO26: next-gen vision AI.
Ultralytics
Back to Ultralytics Glossary

Multi-Object Tracking (MOT)

Explore Multi-Object Tracking (MOT) in computer vision. Learn how to detect and track entities using Ultralytics YOLO26 for autonomous driving, retail, and more.

Multi-Object Tracking (MOT) is a dynamic task in computer vision (CV) that involves detecting multiple distinct entities within a video stream and maintaining their identities over time. Unlike standard object detection, which treats every frame as an isolated snapshot, MOT introduces a temporal dimension to artificial intelligence (AI). By assigning a unique identification number (ID) to each detected instance—such as a specific pedestrian in a crowd or a vehicle on a highway—MOT algorithms allow systems to trace trajectories, analyze behavior, and understand interactions. This capability is fundamental to modern video understanding and enables machines to perceive continuity in a changing environment.

Link to this sectionHow MOT Works#

Most contemporary tracking systems operate on a "tracking-by-detection" paradigm. This approach separates the process into two main stages: identifying what is in the frame and then associating those findings with known objects from the past.

  1. Detection: In each frame, a high-performance model like YOLO26 scans the image to locate objects, generating bounding boxes and class probabilities.

  2. Motion Prediction: To anticipate where an object will move next, algorithms often use a Kalman Filter. This mathematical tool estimates the state of a dynamic system—such as velocity and position—helping to narrow the search area in the subsequent frame.

  3. Data Association: The system matches new detections to existing tracks. Optimization methods like the Hungarian algorithm solve this assignment problem by minimizing the cost of matching, often relying on Intersection over Union (IoU) to measure spatial overlap.

  4. Re-Identification (ReID): When visual obstructions occur—known as occlusion—advanced trackers use visual embeddings to recognize the object when it reappears. This helps prevent "ID switching," ensuring the system knows that the car emerging from a tunnel is the same one that entered it.

Link to this sectionDistinguishing MOT from Single Object Tracking#

While the terminology is similar, Multi-Object Tracking (MOT) differs significantly from Single Object Tracking (SOT). SOT focuses on following one specific target initialized in the first frame, often ignoring all other entities. In contrast, MOT must handle an unknown and varying number of targets that may enter or leave the scene at any time. This makes MOT computationally more demanding, as it requires robust logic to handle track initiation, termination, and the complex interactions between multiple moving bodies.

Link to this sectionReal-World Applications#

The ability to track multiple entities simultaneously drives innovation across several major industries.

  • Autonomous Driving: Self-driving cars rely heavily on MOT to navigate safely. By tracking pedestrians, cyclists, and other vehicles, autonomous systems can predict future positions to avoid collisions. This often involves fusing data from cameras and LiDAR sensors for maximum reliability.
  • Retail Analytics: In physical stores, retailers use AI in retail to map customer journeys. MOT algorithms generate heatmaps of foot traffic, helping managers optimize store layouts and improve queue management during peak hours.
  • Sports Analytics: Professional teams use MOT to analyze player movements and team formations. By tracking every player on the field, coaches can extract detailed metrics on speed, distance covered, and tactical positioning using pose estimation techniques.

Link to this sectionImplementing MOT with Python#

Ultralytics makes it straightforward to implement tracking with state-of-the-art models. The track() method integrates detection and tracking logic seamlessly, supporting algorithms like ByteTrack and BoT-SORT. The example below demonstrates tracking vehicles in a video using the recommended YOLO26 model.

from ultralytics import YOLO

# Load the official YOLO26 small model
model = YOLO("yolo26s.pt")

# Track objects in a video file (or use '0' for webcam)
# The 'persist=True' argument keeps track IDs consistent between frames
results = model.track(source="traffic_analysis.mp4", show=True, persist=True)

# Print the IDs of objects tracked in the first frame
if results[0].boxes.id is not None:
    print(f"Tracked IDs: {results[0].boxes.id.int().tolist()}")

Link to this sectionChallenges in Multi-Object Tracking#

Despite advancements, MOT remains a challenging field. Occlusion is a primary difficulty; when objects cross paths or hide behind obstacles, maintaining identity is complex. Crowded scenes, such as a busy marathon or a flock of birds, test the limits of data association algorithms. Furthermore, maintaining real-time inference speeds while processing high-resolution video streams requires efficient model architectures and often specialized hardware like NVIDIA Jetson devices.

To address these challenges, researchers are exploring end-to-end deep learning approaches that unify detection and tracking into a single network, as well as leveraging the Ultralytics Platform to annotate challenging datasets and train robust custom models.

Explore solutions

Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more

Let's build the future of AI together!

Begin your journey with the future of machine learning