Meet YOLO26: next-gen vision AI.
Ultralytics
Back to Ultralytics Glossary

AI Agent

Explore the world of AI agents. Learn how these autonomous systems use Ultralytics YOLO26 to perceive, reason, and act in real-time to solve complex tasks.

An AI Agent is an autonomous system capable of perceiving its environment, reasoning through complex logic to make decisions, and taking specific actions to achieve defined goals. Unlike a static machine learning model, which passively processes input to produce an output, an agent operates dynamically within a continuous workflow. These systems form the "active" layer of artificial intelligence, bridging the gap between digital predictions and real-world execution. By utilizing memory and adaptive learning, agents can handle tasks ranging from software automation to physical navigation without constant human intervention.

Link to this sectionThe Perception-Reasoning-Action Loop#

The functionality of an AI agent relies on a cyclical process often described as the Perception-Action Loop. This architecture allows the agent to interact meaningfully with its surroundings.

  1. Perception (Sensing): The agent gathers information from the world. In computer vision applications, the agent uses cameras as "eyes." It employs high-speed models like YOLO26 to perform object detection or segmentation, converting raw pixels into structured data.

  2. Reasoning (Thinking): The agent processes the perceived data against its objectives. This stage often integrates Large Language Models (LLMs) for semantic understanding or reinforcement learning algorithms to optimize decision-making strategies. Advanced agents can plan multiple steps ahead, much like a chess player anticipating future moves.

  3. Action (Executing): Based on its reasoning, the agent executes a task. This could be a digital action, such as querying a database or sending an alert, or a physical action in robotics, such as a robotic arm picking a specific item from a conveyor belt.

Link to this sectionAI Agent vs. AI Model#

It is important to distinguish between an agent and a model, as they serve different roles in the technology stack.

  • AI Model: A model is a mathematical engine, such as a neural network, trained to recognize patterns. It is a tool that provides predictions (e.g., "This is a car") but does not inherently act on them.
  • AI Agent: An agent is the encompassing system that uses models as tools. It possesses agency—the capacity to initiate change. For instance, while a model identifies a red light, the agent decides to apply the brakes.

Link to this sectionReal-World Applications#

AI agents are transforming industries by automating workflows that require cognitive flexibility.

  • Smart Manufacturing: In industrial automation, visual agents monitor production lines. If a defect is identified by a quality control system, the agent can autonomously halt machinery and log the incident, preventing waste.
  • Autonomous Logistics: Warehouses utilize agentic robots for inventory management. These agents navigate dynamic environments using SLAM (Simultaneous Localization and Mapping) and vision models to locate, pick, and transport packages efficiently.

Link to this sectionBuilding a Simple Vision Agent#

Developers can build basic agents by combining perception models with conditional logic. The following Python example demonstrates a simple "Security Agent" using the ultralytics package. The agent detects a person and decides whether to trigger an alert based on the model's confidence.

from ultralytics import YOLO

# Load the YOLO26 model (The Agent's Perception)
model = YOLO("yolo26n.pt")

# 1. Perceive: The agent analyzes an image
results = model("bus.jpg")

# 2. Reason & 3. Act: Decision logic based on perception
for result in results:
    # Check if a 'person' (class 0) is detected with high confidence
    if 0 in result.boxes.cls and result.boxes.conf.max() > 0.5:
        print("ACTION: Person detected! Initiating security protocol.")
    else:
        print("ACTION: Area clear. Continuing surveillance.")
  • Edge AI: To react in real-time, agents often run locally on hardware like the NVIDIA Jetson, minimizing latency by processing data at the source rather than the cloud.
  • Artificial General Intelligence (AGI): While current agents are specialized (Narrow AI), AGI refers to hypothetical agents capable of performing any intellectual task that a human can do.
  • Generative AI: Modern agents frequently use GenAI to create dynamic responses or code, acting as assistants that can generate content as part of their workflow.

For those looking to train the underlying models for their agents, the Ultralytics Platform offers a streamlined environment for annotating datasets and managing training runs. Further reading on agent architectures can be found in research from organizations like Stanford HAI and DeepMind.

Explore solutions

Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more

Let's build the future of AI together!

Begin your journey with the future of machine learning