Meet YOLO26: next-gen vision AI.
Ultralytics
Back to Ultralytics Glossary

Pose Estimation

Learn how pose estimation uses keypoints to track movement. Explore real-world applications and get started with Ultralytics YOLO26 for fast, accurate results.

Pose estimation is a specialized computer vision technique that goes beyond simply detecting the presence of objects to understanding their geometric structure and physical orientation. While standard object detection draws a simple rectangular box around a subject, pose estimation identifies specific semantic points, known as keypoints, such as joints on a human body (elbows, knees, shoulders) or structural corners on a vehicle. By mapping these landmarks, machine learning models can reconstruct a skeletal representation of the subject, enabling systems to interpret body language, movement dynamics, and precise positioning in 2D or 3D space.

Link to this sectionCore Mechanisms: Top-Down vs. Bottom-Up#

Modern pose estimation relies heavily on sophisticated deep learning architectures, often utilizing Convolutional Neural Networks (CNNs) to process visual data. The algorithms generally follow one of two primary strategies to identify keypoints:

  • Top-Down Approaches: This method first employs an object detection model to locate individual instances within bounding boxes. Once a person or object is cropped from the larger image, the pose estimator predicts the keypoints within that specific region. This approach is often highly accurate but can suffer from higher inference latency as the number of subjects in the frame increases.
  • Bottom-Up Approaches: Conversely, this strategy detects all potential keypoints in the entire image simultaneously (e.g., finding every "left knee" in a crowd) and then uses association algorithms to group them into individual skeletons. This method is generally preferred for real-time inference in crowded scenes because the computational cost remains relatively constant regardless of how many people are present.

State-of-the-art models like YOLO26 utilize advanced end-to-end architectures that balance these needs, providing high-speed pose estimation suitable for deployment on edge AI devices and mobile platforms.

It is helpful to differentiate pose estimation from other visual recognition tasks to understand its unique value in computer vision workflows:

  • Object Detection: Focuses on identifying what and where an object is, outputting a rectangular box. It treats the subject as a rigid object without understanding its internal articulation.
  • Instance Segmentation: Generates a pixel-perfect mask outlining the object's precise shape. While segmentation provides boundaries, it does not explicitly identify joints or skeletal linkages required for kinematic analysis.
  • Pose Estimation: Specifically targets the internal structure, mapping connections between predetermined landmarks (e.g., hip to knee) to analyze posture and action.

Link to this sectionReal-World Applications#

The ability to digitize human and object movement has led to transformative applications across various industries, often trained using tools like the Ultralytics Platform to manage large datasets of annotated keypoints.

Link to this sectionHealthcare and Rehabilitation#

In the medical field, AI in healthcare utilizes pose estimation to monitor patient rehabilitation remotely. By tracking joint angles and range of motion, automated systems can ensure patients perform physical therapy exercises correctly at home. This reduces the risk of re-injury and allows clinicians to quantify recovery progress without needing expensive laboratory equipment.

Link to this sectionSports Analytics#

Coaches and athletes leverage sports analytics to optimize performance. Pose estimation models can analyze a golfer’s swing plane, a runner’s stride length, or a pitcher’s biomechanics without the need for intrusive marker suits used in traditional motion capture. This provides immediate, data-driven feedback to improve technique and prevent overuse injuries.

Link to this sectionRetail and Behavior Analysis#

In commercial environments, AI in retail systems use pose detection to understand customer behavior, such as reaching for products on high shelves or dwelling in specific aisles. This data helps optimize store layouts and improve inventory management by correlating physical actions with purchasing decisions.

Link to this sectionCode Example: Pose Estimation with YOLO26#

Implementing pose estimation is straightforward with modern Python frameworks. The following example demonstrates how to use the ultralytics package to load a pre-trained YOLO26 model (the successor to YOLO11) and detect human keypoints in an image.

from ultralytics import YOLO

# Load the YOLO26 pose model (nano version for speed)
model = YOLO("yolo26n-pose.pt")

# Perform inference on an image source
# The model identifies bounding boxes and specific keypoints (joints)
results = model("https://ultralytics.com/images/bus.jpg")

# Print the xy coordinates of detected keypoints
print(results[0].keypoints.xy)

# Visualize the skeletal results directly
results[0].show()

Explore solutions

Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more

Let's build the future of AI together!

Begin your journey with the future of machine learning