Yolo Vision Shenzhen
Shenzhen
Junte-se agora
Glossário

Cabeçalho de Detecção

Descubra o papel crítico dos detection heads na detecção de objetos, refinando os mapas de características para identificar com precisão as localizações e classes dos objetos.

A detection head acts as the final decision-making layer in an object detection neural network architecture. While the earlier layers of the model are responsible for understanding the shapes, textures, and features within an image, the detection head is the specific component that interprets this information to predict exactly what objects are present and where they are located. It transforms the abstract, high-level data produced by the feature extractor into actionable results, typically outputting a set of bounding boxes enclosing identified objects along with their corresponding class labels and confidence scores.

Distinguindo a cabeça da coluna vertebral e do pescoço

To fully grasp the function of a detection head, it is helpful to visualize modern detectors as being composed of three primary stages, each serving a distinct purpose in the computer vision (CV) pipeline:

  • Backbone: This is the initial part of the network, often a Convolutional Neural Network (CNN) like ResNet or CSPNet. It processes the raw input image to create feature maps that represent visual patterns.
  • Neck: Sitting between the backbone and the head, the neck refines and combines features from different scales. Architectures like the Feature Pyramid Network (FPN) ensure the model can detect objects of varying sizes by aggregating context.
  • Head: The final component that consumes the refined features from the neck. It performs the actual task of classification (what is it?) and regression (where is it?).

Evolução: Baseada em âncora vs. Sem âncora

The design of detection heads has evolved significantly to improve speed and accuracy, particularly with the transition from traditional methods to modern real-time inference models.

Aplicações no Mundo Real

The precision of the detection head is critical for deploying artificial intelligence (AI) in safety-critical and industrial environments. Users can easily annotate data and train these specialized heads using the Ultralytics Platform.

  • Condução autónoma: Na IA para automóveis, o cabeçote de detecção é responsável por distinguir entre pedestres, semáforos e outros veículos em tempo real. Um cabeçote altamente otimizado garante que a latência de inferência permaneça baixa o suficiente para que o veículo reaja instantaneamente.
  • Diagnósticos médicos: Na análise de imagens médicas, os sensores de detecção são ajustados com precisão para localizar anomalias, como tumores em exames de ressonância magnética. O ramo de regressão deve ser extremamente preciso para delinear os limites exatos de uma lesão, auxiliando os médicos em soluções de saúde.

Exemplo de código

The following example demonstrates how to load a YOLO26 model and inspect the output of its detection head. When inference runs, the head processes the image and returns the final boxes containing coordinates and class IDs.

from ultralytics import YOLO

# Load the YOLO26n model (nano version)
model = YOLO("yolo26n.pt")

# Run inference on an image to utilize the detection head
results = model("https://ultralytics.com/images/bus.jpg")

# The detection head outputs are stored in results[0].boxes
for box in results[0].boxes:
    # Print the bounding box coordinates and the predicted class
    print(f"Class: {int(box.cls)}, Coordinates: {box.xywh.numpy()}")

This interaction highlights how the detection head translates complex neural network activations into readable data that developers can use for downstream tasks like object tracking or counting.

Junte-se à comunidade Ultralytics

Junte-se ao futuro da IA. Conecte-se, colabore e cresça com inovadores globais

Junte-se agora