Metrics#

Anomalib provides a comprehensive set of metrics for evaluating anomaly detection model performance. All metrics extend TorchMetrics’ functionality with Anomalib-specific features.

Available Metrics#

Area Under Curve Metrics#

AUROC

Area Under the Receiver Operating Characteristic curve. Measures the model’s ability to distinguish between normal and anomalous samples.

anomalib.metrics.AUROC
AUPR

Area Under the Precision-Recall curve. Particularly useful for imbalanced datasets.

anomalib.metrics.AUPR
AUPRO

Area Under the Per-Region Overlap curve. Evaluates pixel-level anomaly localization performance.

anomalib.metrics.AUPRO
AUPIMO

Area Under the Per-Image Missed Overlap curve. Advanced metric for evaluating localization quality.

anomalib.metrics.AUPIMO

F1 Score Metrics#

F1Score

Standard F1 score for binary classification. Harmonic mean of precision and recall.

anomalib.metrics.F1Score
F1Max

Maximum F1 score across all possible thresholds. Useful for finding optimal operating points.

anomalib.metrics.F1Max

Threshold Metrics#

F1AdaptiveThreshold

Automatically determines the optimal threshold by maximizing F1 score.

anomalib.metrics.F1AdaptiveThreshold
ManualThreshold

Uses a manually specified threshold for classification.

anomalib.metrics.ManualThreshold

Other Metrics#

PRO

Per-Region Overlap score for evaluating pixel-level localization.

anomalib.metrics.PRO
PIMO

Per-Image Missed Overlap for assessing localization errors.

anomalib.metrics.PIMO
PGn

Presorted Good with n% bad samples missed. Measures false negative rate at specific operating points.

anomalib.metrics.PGn
PBn

Presorted Bad with n% good samples misclassified. Measures false positive rate at specific operating points.

anomalib.metrics.PBn
MinMax

Normalizes anomaly scores to [0, 1] range using min-max scaling.

anomalib.metrics.MinMax
AnomalyScoreDistribution

Analyzes and tracks the distribution of anomaly scores for model diagnostics.

anomalib.metrics.AnomalyScoreDistribution

Utility Classes#

AnomalibMetric

Base class for all Anomalib metrics. Extends TorchMetrics with field-based updates.

anomalib.metrics.AnomalibMetric
Evaluator

Orchestrates multiple metrics for comprehensive model evaluation.

anomalib.metrics.Evaluator
BinaryPrecisionRecallCurve

Computes precision-recall curves for binary classification tasks.

anomalib.metrics.BinaryPrecisionRecallCurve

API Reference#