Glossary

Prompt Enrichment

Master AI with prompt enrichment! Enhance Large Language Models' outputs using context, clear instructions, and examples for precise results.

Prompt enrichment is the process of automatically augmenting a user's initial input with relevant context, data, or instructions before submitting it to an Artificial Intelligence (AI) model. By injecting specific details that the user may have omitted, this technique ensures that Large Language Models (LLMs) and vision systems receive a comprehensive query, leading to more accurate, personalized, and actionable outputs. It acts as an intelligent middleware layer that optimizes interactions between humans and machines without requiring the user to be an expert in crafting detailed prompts.

The Mechanism of Enrichment

The core function of prompt enrichment is to bridge the gap between a user's possibly vague intent and the precise input an AI needs. When a query is received, the system retrieves supplementary information—such as user preferences, historical data, or real-time sensor readings—from a knowledge graph or database. This retrieved data is programmatically formatted and appended to the original query.

For example, in Natural Language Processing (NLP), a simple question like "What is the status?" is insufficient for a model. Through enrichment, the system identifies the user's active session ID, looks up the latest transaction in a vector database, and rewrites the prompt to: "The user (ID: 5521) is asking about Order #998, which is currently in transit. Provide a status update based on this tracking data."

Real-World Applications

Prompt enrichment is essential for deploying robust generative AI applications across various industries:

Context-Aware Customer Support: In automated helpdesks, a chatbot uses enrichment to access a customer's purchase history and technical environment. Instead of asking the user for their device version, the system retrieves this from the account metadata and injects it into the prompt. This allows the AI agent to provide immediate, device-specific troubleshooting steps, significantly improving the customer experience.
Dynamic Computer Vision Configuration: In security operations, a user might simply toggle a "Night Mode" setting. Behind the scenes, prompt enrichment translates this high-level intent into specific object classes for a Vision Language Model (VLM) or an open-vocabulary detector. The system enriches the prompt to specifically look for "flashlight," "suspicious movement," or "unauthorized person," enabling the model to adapt its object detection focus dynamically.

Example: Dynamic Class Enrichment with YOLO-World

The following Python example demonstrates the concept of prompt enrichment using Ultralytics YOLO-World. Here, a user's simple "mode" selection is programmatically enriched into a list of specific descriptive classes that the model scans for.

from ultralytics import YOLO


def run_enriched_inference(user_mode):
    """Enriches a simple user mode into specific detection prompts."""
    # Load an open-vocabulary YOLO model
    model = YOLO("yolov8s-world.pt")

    # Enrichment Logic: Map simple user intent to detailed class prompts
    context_map = {
        "site_safety": ["hard hat", "safety vest", "gloves"],
        "traffic": ["car", "bus", "traffic light", "pedestrian"],
    }

    # Inject the enriched context into the model
    enriched_classes = context_map.get(user_mode, ["object"])
    model.set_classes(enriched_classes)

    # The model now looks for the specific enriched terms
    # model.predict("site_image.jpg") # Run inference
    print(f"Mode: {user_mode} -> Enriched Prompt: {enriched_classes}")


# Example usage
run_enriched_inference("site_safety")