Yolo Vision Shenzhen
Shenzhen
Join now
Glossary

Neural Network (NN)

Discover the power of Neural Networks—key to AI and ML innovations like computer vision, NLP, and deep learning breakthroughs.

A Neural Network (NN) is a computational model designed to recognize patterns, solve complex problems, and process information in a way that mimics the biological structure of the human brain. Serving as the fundamental building block of modern Artificial Intelligence (AI), neural networks allow computers to learn from observation rather than relying solely on manually programmed rules. While they are a subset of Machine Learning (ML), their ability to model intricate, non-linear relationships makes them indispensable for advanced tasks ranging from speech recognition to powering autonomous vehicles. You can explore the biological inspiration behind these systems in this overview of neural networks by IBM.

Architecture and Functionality

The structure of a neural network is composed of interconnected layers of nodes, often referred to as artificial neurons. These layers facilitate the flow of data from input to output through a series of mathematical transformations.

  • Input Layer: This is the entry point where the network receives raw training data. This could be pixels from a photograph, audio waves, or numerical features from a spreadsheet.
  • Hidden Layers: Located between the input and output, these layers perform the heavy lifting. Each neuron in a hidden layer processes the signal it receives by applying model weights and biases. To introduce non-linearity—which allows the network to learn complex boundaries—an activation function like ReLU or Sigmoid determines whether the neuron should "fire" and pass the signal forward.
  • Output Layer: The final layer produces the network's prediction, such as a probability score for a classification task (e.g., identifying a tumor in medical image analysis).

To "learn," the network undergoes a process called model training. During this phase, the system compares its predictions to the correct answers using a loss function. Through a technique known as backpropagation, the network calculates the error and uses an optimization algorithm, such as Stochastic Gradient Descent (SGD) or Adam, to adjust the weights. This process repeats over many epochs until the error is minimized.

Neural Networks vs. Deep Learning

While the terms are often used interchangeably, it is important to distinguish between a basic Neural Network and Deep Learning (DL). The primary difference lies in depth and complexity. A standard or "shallow" neural network may have only one or two hidden layers. In contrast, Deep Learning involves "deep" neural networks with dozens or even hundreds of layers. This depth enables feature extraction to happen automatically, allowing the model to understand hierarchical patterns—simple edges become shapes, and shapes become recognizable objects. For a deeper technical dive, MIT News explains deep learning and its evolution from basic networks.

Real-World Applications

Neural networks are the engines behind many technologies that define the modern era.

  1. Computer Vision: In the field of Computer Vision (CV), specialized networks called Convolutional Neural Networks (CNNs) are used to analyze visual data. Advanced models like Ultralytics YOLO26 utilize deep neural network architectures for real-time object detection. These systems are critical for AI in agriculture, where they monitor crop health, and in security systems for anomaly detection.
  2. Natural Language Processing: For tasks involving text, architectures like Recurrent Neural Networks (RNNs) and Transformers have revolutionized how machines understand human language. These networks power machine translation tools and virtual assistants. You can see the impact of these technologies in AI in healthcare, where they assist in transcribing medical notes and analyzing patient records.

Practical Implementation

Modern software libraries make it accessible to deploy neural networks without needing to write the mathematical operations from scratch. The following Python code demonstrates how to load a pre-trained neural network (specifically the state-of-the-art YOLO26 model) and run inference on an image using the ultralytics package.

from ultralytics import YOLO

# Load a pretrained YOLO26 neural network model
model = YOLO("yolo26n.pt")

# Run inference on an image to detect objects
# The model processes the image through its layers to predict bounding boxes
results = model("https://ultralytics.com/images/bus.jpg")

# Display the results
results[0].show()

Tools and Ecosystem

Building and managing neural networks requires a robust ecosystem of tools.

  • Frameworks: PyTorch and TensorFlow are the leading open-source libraries, providing the essential building blocks for designing custom architectures.
  • Hardware: Training these networks involves massive matrix calculations, which are significantly accelerated by GPU (Graphics Processing Unit) technology. You can read about the importance of hardware in NVIDIA's guide to AI computing.
  • Development Platforms: For a streamlined workflow, the Ultralytics Platform provides a comprehensive environment for managing datasets, orchestrating training sessions, and handling model deployment to various edge devices.

Join the Ultralytics community

Join the future of AI. Connect, collaborate, and grow with global innovators

Join now