정규화(Normalization)
Explore how normalization improves [machine learning](https://www.ultralytics.com/glossary/machine-learning-ml) performance. Learn scaling techniques like Min-Max and Z-score to optimize [YOLO26](https://docs.ultralytics.com/models/yolo26/) on the [Ultralytics Platform](https://platform.ultralytics.com).
Normalization is a fundamental technique in
data preprocessing that involves rescaling
numeric attributes to a standard range. In the context of
machine learning (ML), datasets often contain
features with varying scales—such as age ranges (0–100) versus income levels (0–100,000). If left untreated, these
disparities can cause the
optimization algorithm to become biased
toward larger values, leading to slower convergence and suboptimal performance. By normalizing data, engineers ensure
that every feature contributes proportionately to the final result, allowing
neural networks to learn more efficiently.
일반적인 정규화 기법
There are several standard methods for transforming data, each suited for different distributions and algorithm
requirements.
-
Min-Max Scaling:
This is the most intuitive form of normalization. It rescales the data to a fixed range, usually [0, 1]. This
transformation is performed by subtracting the minimum value and dividing by the range (maximum minus minimum). It
is widely used in
image processing
where pixel intensities are known to be bounded between 0 and 255.
-
Z-Score Standardization: While often
used interchangeably with normalization, standardization specifically transforms data to have a mean of 0 and a
standard deviation of 1. This is particularly useful when the data follows a
Gaussian distribution and is essential for
algorithms like
Support Vector Machines (SVM) that
assume normally distributed data.
-
Log Scaling:
For data containing extreme outliers or following a power law, applying a logarithmic transformation can compress
the range of values. This makes the distribution more manageable for the
inference engine to interpret effectively
without being skewed by massive value spikes.
실제 애플리케이션
정규화는 다양한 산업 분야의 고성능 AI 시스템 파이프라인에서 표준적인 단계입니다.
-
Computer Vision (CV): In tasks such as
object detection and
image classification, digital images are
composed of pixel values ranging from 0 to 255. Feeding these large integers directly into a network can slow down
gradient descent. A standard preprocessing step
involves dividing pixel values by 255.0 to normalize them to the [0, 1] range. This practice ensures consistent
inputs for advanced models like YOLO26, improving training
stability on the Ultralytics Platform.
-
의료 영상 분석: 의료용 스캔(예: 의료 AI에 사용되는 스캔)은 종종 서로 다른 기기에서 생성되며, 각 기기마다 강도 척도가 다릅니다. 정규화는 MRI 또는 CT 스캔의 픽셀 강도가 서로 다른 환자와 장비 간에 비교 가능하도록 보장합니다. 이러한 일관성은 정확한 종양 탐지에 매우 중요하며, 모델이 밝기 변화보다는 구조적 이상에 집중할 수 있게 합니다.
관련 개념 구분하기
It is important to differentiate normalization from similar preprocessing and architectural terms found in deep
learning.
-
vs. Batch Normalization:
Data normalization is a preprocessing step applied to the raw input dataset before it enters the network.
Conversely, Batch Normalization operates internally between layers throughout the network during
model training. It normalizes the output of a previous
activation layer to stabilize the learning process.
-
vs. Image Augmentation:
While normalization changes the scale of the pixel values, augmentation changes the content or
geometry of the image (e.g., flipping, rotating, or changing colors) to increase dataset diversity. Tools like
Albumentations are used for augmentation,
whereas normalization is a mathematical scaling operation.
구현 예시
In computer vision, normalization is often the first step in the pipeline. The following
Python example demonstrates how to manually normalize image data using the
NumPy library, a process that happens automatically within the
Ultralytics YOLO26 data loader during training.
import numpy as np
# Simulate a 2x2 pixel image with values ranging from 0 to 255
raw_image = np.array([[0, 255], [127, 64]], dtype=np.float32)
# Apply Min-Max normalization to scale values to [0, 1]
# This standardizes the input for the neural network
normalized_image = raw_image / 255.0
print(f"Original Range: {raw_image.min()} - {raw_image.max()}")
print(f"Normalized Range: {normalized_image.min()} - {normalized_image.max()}")