Meet YOLO26: next-gen vision AI.
Ultralytics
Back to Ultralytics Glossary

Convolutional Neural Network (CNN)

Explore how Convolutional Neural Networks (CNNs) power modern computer vision. Learn about layers, applications, and how to run Ultralytics YOLO26 for real-time AI.

A Convolutional Neural Network (CNN) is a specialized deep learning architecture designed to process data with a grid-like topology, most notably digital images. Inspired by the biological structure of the visual cortex, CNNs are uniquely capable of preserving spatial relationships within input data. Unlike traditional neural networks that flatten an image into a long list of numbers, CNNs analyze small, overlapping regions of an image to automatically learn hierarchies of features—from simple edges and textures to complex shapes and objects. This ability makes them the foundational technology behind modern computer vision (CV) systems.

Link to this sectionHow Convolutional Neural Networks Work#

The power of a CNN lies in its ability to reduce a complex image into a form that is easier to process without losing features critical for getting a good prediction. This is achieved through a pipeline of distinct layers that transform the input volume into an output class or value:

  • Convolution Layer: This is the core building block. It uses a set of learnable filters (or kernels) that slide over the input image like a flashlight. At each position, the filter performs a mathematical operation called convolution, creating a feature map that highlights specific patterns such as horizontal lines or color gradients.
  • Activation Function: After convolution, a non-linear function is applied to the output. The most common choice is the ReLU (Rectified Linear Unit), which turns negative pixel values to zero. This introduces non-linearity, allowing the network to learn complex patterns beyond simple linear relationships.
  • Pooling Layer: Also known as downsampling, this layer reduces the dimensionality of the feature maps. Techniques like max pooling keep only the most important features (the highest values) in a region, which reduces computational load and helps prevent overfitting.
  • Fully Connected Layer: In the final stage, the processed features are flattened and fed into a standard neural network (NN). This layer uses the high-level features identified by the previous layers to make a final classification or prediction, such as "cat" or "dog."

Link to this sectionReal-World Applications#

CNNs have transformed industries by automating visual tasks with superhuman accuracy.

  • Medical Diagnostics: In healthcare, CNNs assist radiologists by identifying anomalies in medical scans faster than the human eye. For example, deep learning models analyze MRI and CT scans to detect early signs of tumors or fractures. Research involving AI in radiology highlights how these tools improve diagnostic consistency and speed.
  • Autonomous Systems: Self-driving cars rely heavily on CNNs to perceive their surroundings. Models like YOLO26 utilize efficient CNN backbones to perform real-time object detection, identifying pedestrians, traffic signs, and other vehicles to make split-second driving decisions.

Link to this sectionCNNs vs. Vision Transformers (ViT)#

While CNNs have long been the standard for vision tasks, a newer architecture called the Vision Transformer (ViT) has emerged.

  • CNNs process images using local features and are highly efficient on smaller datasets due to their "inductive bias" (they assume nearby pixels are related). They excel in scenarios requiring real-time inference on edge devices.
  • ViTs split images into patches and process them using global self-attention mechanisms. This allows them to capture long-range dependencies across an image but typically requires massive datasets and more compute power to train effectively.

Link to this sectionImplementation Example#

Modern libraries make it straightforward to use CNN-based models. The ultralytics package provides access to state-of-the-art models like YOLO26, which feature highly optimized CNN architectures for rapid inference.

The following example demonstrates how to load a pre-trained CNN model and run a prediction:

from ultralytics import YOLO

# Load a YOLO26 model, which uses an advanced CNN architecture
model = YOLO("yolo26n.pt")

# Run inference on an image to identify objects
results = model("https://ultralytics.com/images/bus.jpg")

# Display the prediction results
results[0].show()

Link to this sectionTools for Development#

Developing CNNs is supported by a robust ecosystem of open-source tools. Engineers typically use frameworks such as PyTorch or TensorFlow to build custom architectures. These libraries provide the low-level tensor operations required for convolution and backpropagation.

For teams looking to streamline the lifecycle of computer vision projects—from data collection to deployment—the Ultralytics Platform offers a comprehensive solution. It simplifies complex workflows, allowing developers to focus on applying CNNs to solve business problems rather than managing infrastructure. Additionally, models can be exported to formats like ONNX or TensorRT for high-performance deployment on edge devices.

Explore solutions

Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more
Real-time AI that works with your team

AI in Robotics

Power smarter machines with Ultralytics YOLO models. Vision AI in robotics drives autonomous navigation, perception, object tracking, and real-time control.
Learn more
Real-time AI that works with your team

AI in Logistics

Streamline logistics with Ultralytics YOLO models. Vision AI enables package inspection, sorting, vehicle tracking, and real-time warehouse safety monitoring.
Learn more
Real-time AI that works with your team

AI in Retail

Reimagine retail with Ultralytics YOLO models. Vision AI powers inventory tracking, shelf monitoring, queue management, and smarter customer insights.
Learn more
Real-time AI that works with your team

AI in Healthcare

Build healthcare solutions with Ultralytics YOLO models. Vision AI in healthcare powers faster medical imaging, smarter diagnostics, and patient monitoring.
Learn more
Real-time AI that works with your team

AI in Manufacturing

Optimize manufacturing with Ultralytics YOLO models. Vision AI drives quality control, defect detection, PPE compliance, and assembly line automation.
Learn more
Real-time AI that works with your operation

AI in Automotive

Apply computer vision in automotive with Ultralytics YOLO models. Vision AI elevates road safety, driver assistance, and vehicle automation for smarter roads.
Learn more
Real-time AI tailored to your operation

AI in Agriculture

Bring vision AI to smart agriculture with Ultralytics YOLO models. Power crop monitoring, livestock tracking, and precision farming for higher, smarter yields.
Learn more

Let's build the future of AI together!

Begin your journey with the future of machine learning