Glossary

Neural Network (NN)

Discover the power of Neural Networks—key to AI and ML innovations like computer vision, NLP, and deep learning breakthroughs.

Train YOLO models simply
with Ultralytics HUB

Learn more

Neural Networks (NNs), often called Artificial Neural Networks (ANNs), are computational models inspired by the complex structure and function of the human brain. They form a cornerstone of modern Artificial Intelligence (AI) and Machine Learning (ML), designed primarily to recognize intricate patterns within data. These networks power many intelligent systems we interact with daily, from sophisticated image recognition tools to advanced language translation services, and are fundamental to fields like computer vision (CV) and Natural Language Processing (NLP).

Understanding Neural Networks

A neural network consists of layers of interconnected nodes, or neurons. These layers typically include an input layer that receives raw data (like pixel values in an image or words in a sentence), one or more hidden layers that process the data through computations, and an output layer that produces the final result, such as a classification label or a predicted value. Each connection between neurons carries a 'weight', a numerical value signifying the connection's importance in the network's calculations.

Neurons process their combined inputs using an Activation Function, like ReLU or Sigmoid, which determines the output signal they pass forward to the next layer. The network 'learns' by adjusting these weights during a training process, typically using labeled examples in a supervised learning setting. Algorithms like gradient descent and the backpropagation algorithm are commonly used to iteratively tweak the weights to minimize a loss function, which measures the difference between the network's predictions and the actual target values. Effective model training often requires careful hyperparameter tuning and strategies outlined in guides like our Model Training Tips.

Models employing Deep learning (DL) are essentially NNs with multiple hidden layers (hence "deep"). This depth allows them to learn highly complex patterns and hierarchical features from large datasets, making them exceptionally effective for tasks like object detection and instance segmentation. Unlike simpler ML models such as Linear Regression or Support Vector Machines (SVM) that might require manual feature engineering, NNs excel at automatically performing feature extraction from raw, unstructured data like images and text.

Types of Neural Networks and Distinctions

While the basic structure described above applies generally, several specialized NN architectures exist:

NNs differ significantly from traditional algorithms like Decision Trees or k-Nearest Neighbors (KNN), which operate on different principles and are often less suited for high-dimensional, complex data like images without extensive preprocessing.

Applications of Neural Networks

Neural Networks are incredibly versatile and have been applied across numerous domains. Here are two key examples:

  1. Computer Vision (CV): NNs, especially CNNs, are the driving force behind major advancements in CV. They are used for:

  2. Natural Language Processing (NLP): NNs, including RNNs and Transformers, have revolutionized how machines understand and generate human language. Applications include:

Tools and Frameworks

Developing and deploying NNs is facilitated by various powerful deep learning tools and frameworks:

Understanding NNs is crucial for anyone working in AI or ML, as they form the foundation for many state-of-the-art solutions across diverse industries, from AI in healthcare to AI in agriculture.

Read all