Glossary

Memory Bank

A memory bank is a data structure used in machine learning algorithms to store and reference information from past iterations or processed samples, effectively decoupling the model's memory capacity from its immediate computational constraints. In the context of deep learning (DL), a memory bank typically serves as a repository for embeddings or feature vectors. This allows a model to compare the current input against a vast history of previous inputs without needing to re-process or hold all that data in the active random-access memory (RAM) simultaneously. By maintaining a buffer of representations, models can learn from a broader context, improving performance in tasks that require long-term consistency or comparison against large datasets.

The Mechanics of a Memory Bank

The primary function of a memory bank is to extend the available information beyond the current batch size. During training, as data flows through the neural network, the resulting feature representations are pushed into the bank. If the bank reaches its maximum capacity, the oldest features are usually removed to make room for new ones, a process known as a First-In, First-Out (FIFO) queue.

This mechanism is particularly vital because GPU memory is finite. Without a memory bank, comparing a single image against a million other images would require a batch size impossible to fit on standard hardware. With a memory bank, the model can store lightweight vectors of those million images and reference them efficiently using similarity search techniques, such as dot product or cosine similarity.

Real-World Applications

Memory banks have become a cornerstone in several advanced computer vision (CV) and natural language workflows:

Contrastive Learning (Self-Supervised Learning): One of the most famous applications is in contrastive learning, specifically in algorithms like Momentum Contrast (MoCo). Here, the goal is to teach the model to distinguish a specific image from many "negative" samples (different images). A memory bank stores thousands of negative sample representations, allowing the model to learn robust features without requiring labeled training data. For deep technical details, researchers often reference the MoCo paper which popularized this approach.
Long-Term Object Tracking: In video analysis, an object (like a car or person) may be temporarily obscured by an obstacle. Standard trackers might lose the object's identity (ID) during this occlusion. Advanced trackers use a memory bank to store the visual features of objects detected in the past. When the object reappears, the system queries the bank to re-establish the correct ID. Users leveraging Ultralytics YOLO26 for object tracking benefit from similar internal logic that maintains identity consistency across frames.
Video Understanding: To recognize actions that span several seconds or minutes, models need temporal context. A memory bank acts as a buffer for past frames or clips, enabling the network to "remember" what happened at the start of a video while processing the end. This is crucial for accurate action recognition.

Distinguishing Related Concepts

It is helpful to differentiate the memory bank from other storage and processing concepts found in the glossary:

Memory Bank vs. Vector Database: Both store embeddings for retrieval. However, a memory bank is typically a transient, in-memory structure used dynamically during the training or active inference of a single model session. A vector database (like those used in RAG) is a persistent, scalable storage solution meant to last indefinitely and serve multiple applications.
Memory Bank vs. Context Window: A context window (common in Transformers) defines the maximum input sequence length the model processes at once (e.g., 32k tokens). A memory bank is an external buffer that stores compressed representations outside the active processing window, theoretically allowing for infinite memory depth, as seen in architectures like Transformer-XL.
Memory Bank vs. Batch Size: Batch size dictates how many samples are processed in parallel for gradient updates. A memory bank increases the effective number of samples the model can "see" for comparison purposes without increasing the computational cost of the forward and backward passes.

Code Example: Simulating a Feature Bank

The following Python snippet demonstrates the concept of a First-In, First-Out (FIFO) memory bank using torch. This structure is often used to maintain a rolling history of feature vectors during custom training loops or complex inference tasks.

import torch

# Initialize a memory bank (Capacity: 100 features, Vector Dim: 128)
# In a real scenario, these would be embeddings from a model like YOLO26
memory_bank = torch.randn(100, 128)

# Simulate receiving a new batch of features (e.g., from the current image batch)
new_features = torch.randn(10, 128)

# Update the bank: Enqueue new features, Dequeue the oldest ones
# This maintains a fixed size while keeping the memory 'fresh'
memory_bank = torch.cat([memory_bank[10:], new_features], dim=0)

print(f"Updated Memory Bank Shape: {memory_bank.shape}")
# Output: Updated Memory Bank Shape: torch.Size([100, 128])

Challenges and Considerations

While powerful, memory banks introduce the challenge of "representation drift." Since the encoder network changes slightly with every training step, the features stored in the bank from 100 steps ago might be "stale" or inconsistent with the current model state. Techniques like using a momentum encoder (a slowly updating average of the model) help mitigate this issue.

For teams looking to manage dataset versions and model artifacts that utilize these advanced techniques, the Ultralytics Platform provides a centralized hub to organize data, track experiments, and deploy models efficiently. Managing the complexity of feature storage and retrieval is essential for moving from experimental artificial intelligence (AI) to robust production systems.

Memory Bank

Train Ultralytics YOLO models to streamline workflows across industries

Flexible enterprise licensing solution to power your innovation

Train AI models in seconds with Ultralytics YOLO

The Mechanics of a Memory Bank

Real-World Applications

Distinguishing Related Concepts

Code Example: Simulating a Feature Bank

Challenges and Considerations

Read more in this category

12 aerial imagery use cases powered by computer vision

What is monocular depth estimation? An overview

A look at using Ultralytics YOLO models for AI threat detection

Join the Ultralytics community