Descubra como a pesquisa vetorial revoluciona a IA, permitindo a similaridade semântica na recuperação de dados para PNL, pesquisa visual, sistemas de recomendação e muito mais!
Vector search is a sophisticated method of information retrieval that identifies similar items within a dataset based on their mathematical characteristics rather than exact keyword matches. Unlike traditional keyword search, which relies on finding specific character strings, vector search analyzes the underlying semantic meaning of the data. This technique is fundamental to modern artificial intelligence (AI) applications because it allows computers to understand relationships between abstract concepts, processing unstructured data such as images, audio files, and natural language text with remarkable accuracy.
The core of vector search involves transforming raw data into high-dimensional numerical vectors known as embeddings. This process maps items to points in a multi-dimensional space where conceptually similar items are located close to one another.
To implement vector search, you must first convert your data into vectors. The following code snippet demonstrates how
to generate feature maps and embeddings from an image
using the ultralytics package and a pre-trained YOLO26 model.
from ultralytics import YOLO
# Load a pre-trained YOLO26 classification model
model = YOLO("yolo26n-cls.pt")
# Generate feature embeddings for an image URL
# The 'embed' method returns the high-dimensional vector representation
results = model.embed("https://ultralytics.com/images/bus.jpg")
# Print the shape of the resulting embedding vector
print(f"Embedding vector shape: {results[0].shape}")
Vector search is the engine behind many intuitive features in today's software ecosystem, bridging the gap between computer vision (CV) and user intent.
It is helpful to differentiate vector search from similar terms to understand the full machine learning (ML) pipeline.
For teams building similarity search systems, managing datasets and training embedding models is a crucial first step. The Ultralytics Platform simplifies this workflow by providing tools for data management, cloud training, and model deployment. By ensuring your base models—whether for object detection or classification—are high-performing, you ensure the resulting vectors provide accurate and meaningful search results.