머신러닝에서 재현율(Recall)이 무엇인지, 왜 중요한지, AI 모델이 중요한 긍정적 인스턴스를 효과적으로 캡처하는 방법을 알아보세요.
Recall, also known as sensitivity or the true positive rate, is a fundamental performance metric in machine learning that measures the ability of a model to identify all relevant instances within a dataset. In the context of object detection or classification, it specifically answers the question: "Out of all the actual positive cases, how many did the model correctly find?" Achieving high recall is critical in scenarios where missing a positive instance—often referred to as a false negative—carries significant consequences. Unlike accuracy, which can be misleading when dealing with imbalanced data, recall provides a focused view on the model's effectiveness at "capturing" the target class.
In many artificial intelligence applications, the cost of failing to detect an object is far higher than the cost of a false alarm. A model optimized for recall minimizes false negatives, ensuring that the system casts a wide enough net to catch potential threats, anomalies, or critical conditions. This often involves a trade-off, as increasing recall can sometimes lead to a lower precision score, meaning the model might flag more non-relevant items as positive. Understanding this balance is key to developing robust machine learning solutions.
리콜은 많은 안전이 중요한 AI 솔루션의 핵심 지표입니다. 다음은 민감도가 우선시되는 두 가지 대표적인 사례입니다:
It is essential to distinguish recall from its counterpart, precision. While recall measures the quantity of relevant cases found (completeness), precision measures the quality of the positive predictions (exactness).
These two metrics often share an inverse relationship, visualized through a Precision-Recall curve. To evaluate the overall balance between them, developers often look at the F1-score, which is the harmonic mean of both. In imbalanced datasets, looking at recall alongside the confusion matrix gives a much clearer picture of performance than accuracy alone.
When training models like the cutting-edge YOLO26, recall is automatically computed during the validation phase. The framework calculates recall for each class and the mean Average Precision (mAP), helping developers gauge how well the model finds objects.
훈련된 모델을 쉽게 검증하고 리콜 지표를 확인할 수 있습니다. 다음 Python 조각은 모델을 로드하고 표준 데이터셋에서 성능을 확인하는 방법을 보여줍니다:
from ultralytics import YOLO
# Load a pretrained YOLO26 model
model = YOLO("yolo26n.pt")
# Validate the model on the COCO8 dataset
# The results object contains metrics like Precision, Recall, and mAP
metrics = model.val(data="coco8.yaml")
# Access and print the mean recall score for box detection
print(f"Mean Recall: {metrics.results_dict['metrics/recall(B)']:.4f}")
This code utilizes the Ultralytics API to run validation. If the recall is lower than required for your project, you might consider techniques like data augmentation to create more varied training examples or hyperparameter tuning to adjust the model's sensitivity. Using the Ultralytics Platform can also streamline the process of managing datasets and tracking these metrics over multiple training runs.
To boost a model's recall, data scientists often adjust the confidence threshold used during inference. Lowering the threshold makes the model more "optimistic," accepting more predictions as positive, which increases recall but may decrease precision. Additionally, collecting more diverse training data helps the model learn to recognize hard negatives and obscure instances. For complex tasks, employing advanced architectures like Transformer blocks or exploring ensemble methods can also improve the system's ability to detect subtle features that simpler models might miss.