Consistency Models

일관성 모델(consistency models)이 어떻게 단일 단계에서 빠르고 고품질의 생성형 AI를 가능하게 하는지 알아보십시오. 실시간 추론을 위해 확산 모델(diffusion models)과 어떻게 다른지 배우십시오.

Generative artificial intelligence has made massive leaps in visual fidelity, but processing speed often remains a bottleneck. Consistency models are an advanced family of generative AI architectures designed to create high-quality data in a single step or very few steps, bypassing the computationally expensive sampling processes required by earlier probabilistic frameworks. Originally introduced in foundational machine learning research by OpenAI, this approach establishes a new standard for rapid data synthesis.

이 네트워크들은 수백 단계에 걸쳐 점진적으로 노이즈를 제거하는 대신, 노이즈가 있는 데이터 포인트를 깨끗한 원본 형태로 직접 연결하는 수학적 매핑을 학습합니다. 모델은 특정 노이즈 궤적을 따라 상미분 방정식(ODEs)을 해결함으로써, 해당 경로상의 모든 포인트가 정확히 동일한 최종 출력값으로 매핑되도록 보장합니다. 이러한 "일관성(consistency)" 속성 덕분에 실무자는 중간 단계를 완전히 건너뛸 수 있습니다. Google DeepMind의 발전과 같은 폭넓은 혁신에서 영감을 받은 최근의 Latent Consistency Models (LCMs)와 같은 돌파구들은 이 과정을 더욱 최적화했습니다. 압축된 잠재 공간(latent space)에서 작동함으로써, LCMs는 메모리 요구 사항을 획기적으로 줄이고 text-to-image 생성 파이프라인을 가속화합니다.

Link to this sectionConsistency Models 대 Diffusion Models#

이 아키텍처를 Diffusion Models와 비교할 때, 가장 큰 차이점은 생성 타임라인에 있습니다. 기존의 diffusion 프레임워크는 이미지를 구성하기 위해 점진적이고 반복적인 노이즈 제거 루프에 의존하는 반면, consistency models는 실시간 추론을 위해 명시적으로 설계되었습니다. Diffusion은 놀라운 세부 묘사를 제공하지만 실시간 사용자 대면 애플리케이션에는 너무 느린 경우가 많으므로, 낮은 추론 지연 시간이 엄격한 프로젝트 제약 조건일 때 더 최신인 consistency 기반 접근 방식이 선호됩니다.

Link to this section실제 애플리케이션 사례#

고성능 출력을 즉각적으로 생성하는 능력은 빠르게 변화하는 다양한 산업 전반에서 새로운 가능성을 열어줍니다.

인터랙티브 미디어 및 비디오 게임: 게임 개발자들은 이러한 초고속 네트워크를 사용하여 동적이고 즉각적인 텍스처와 시각적 에셋을 생성하며, 렌더링 엔진을 멈추지 않고도 반응형 가상 환경을 구현합니다.
합성 데이터 생성: 의료 영상 분석과 같은 전문 분야에서 엔지니어들은 이러한 아키텍처를 배포하여 다양한 학습 데이터를 신속하게 합성합니다. 이는 계산 예산이 엄격히 제한된 에지 컴퓨팅 하드웨어 및 에지 AI 환경에서 특히 유용합니다.

Link to this section현대 컴퓨터 비전에서의 속도#

The pursuit of low-latency execution isn't limited to generative media; it is a universal goal across all forms of computer vision. For instance, Ultralytics YOLO26 is engineered entirely for native end-to-end efficiency. By eliminating post-processing bottlenecks, it enables real-time computing for both object detection and complex image segmentation tasks. For broader model optimization, developers can effortlessly manage datasets, train rapid models, and deploy them using the Ultralytics Platform.

The following code example demonstrates how to perform high-speed, single-pass inference using the highly optimized yolo26n.pt model, utilizing hardware acceleration via PyTorch to mirror the modern industry demand for rapid machine learning operations:

from ultralytics import YOLO

# Load the lightning-fast YOLO26 nano model for low-latency visual tasks
model = YOLO("yolo26n.pt")

# Perform a rapid, single-step prediction on an input image using GPU acceleration
results = model.predict(source="image.jpg", conf=0.5, device="cuda")

Explore solutions

로봇 공학에서의 AI

Ultralytics YOLO 모델로 더 스마트한 기기를 구동하십시오. 로봇 공학의 비전 AI는 자율 주행, 인식, 객체 추적 및 실시간 제어를 촉진합니다.

Consistency Models

Link to this sectionConsistency Models 대 Diffusion Models#

Link to this section실제 애플리케이션 사례#

Link to this section현대 컴퓨터 비전에서의 속도#

Explore solutions

로봇 공학에서의 AI

물류 분야의 AI

소매업에서의 AI

의료 분야의 AI

제조 분야의 AI

자동차 분야의 AI

농업 분야의 AI

로봇 공학에서의 AI

물류 분야의 AI

소매업에서의 AI

의료 분야의 AI

제조 분야의 AI

자동차 분야의 AI

농업 분야의 AI

로봇 공학에서의 AI

물류 분야의 AI

소매업에서의 AI

의료 분야의 AI

제조 분야의 AI

자동차 분야의 AI

농업 분야의 AI

미래의 AI를 함께 구축합시다!