UniNet

UniNet#

Lightning model for UniNet.

class anomalib.models.image.uninet.lightning_model.UniNet(student_backbone='wide_resnet50_2', teacher_backbone='wide_resnet50_2', temperature=0.1, pre_processor=True, post_processor=True, evaluator=True, visualizer=True)#

Bases: AnomalibModule

UniNet model for anomaly detection.

Parameters:

student_backbone (str) – The backbone model to use for the student network. Defaults to “wide_resnet50_2”.
teacher_backbone (str) – The backbone model to use for the teacher network. Defaults to “wide_resnet50_2”.
temperature (float) – Temperature parameter used for contrastive loss. Controls the temperature of the student and teacher similarity computation. Defaults to 0.1.
pre_processor (PreProcessor | bool, optional) – Preprocessor instance or bool flag. Defaults to True.
post_processor (PostProcessor | bool, optional) – Postprocessor instance or bool flag. Defaults to True.
evaluator (Evaluator | bool, optional) – Evaluator instance or bool flag. Defaults to True.
visualizer (Visualizer | bool, optional) – Visualizer instance or bool flag. Defaults to True.

configure_optimizers()#

Configure optimizers for training.

Returns:: Optimizers for student and target teacher.
Return type:: tuple[torch.optim.Optimizer, torch.optim.Optimizer]

property learning_type: LearningType#: The model uses one-class learning.

property trainer_arguments: dict[str, Any]#: Does not require any trainer arguments.

training_step(batch, *args, **kwargs)#

Perform a training step of UniNet.

Return type:: Union[Tensor, Mapping[str, Any], None]

validation_step(batch, *args, **kwargs)#

Perform a validation step of UniNet.

Return type:: Union[Tensor, Mapping[str, Any], None]

PyTorch model for UniNet.

See also

anomalib.models.image.uninet.lightning_model.UniNet:: UniNet Lightning model.

class anomalib.models.image.uninet.torch_model.Teachers(teacher_backbone)#

Bases: Module

Teachers module for UniNet.

Parameters:

source_teacher (nn.Module) – Source teacher model.
target_teacher (nn.Module | None) – Target teacher model.

forward(images)#

Forward pass of the teachers.

Parameters:: images (torch.Tensor) – Input images.
Returns:: Source features or source and target features.
Return type:: torch.Tensor | tuple[torch.Tensor, list[torch.Tensor]]

class anomalib.models.image.uninet.torch_model.UniNetModel(student_backbone, teacher_backbone, loss)#

Bases: Module

UniNet PyTorch model.

It consists of teachers, student, and bottleneck modules.

Parameters:

student_backbone (str) – Student backbone model.
teacher_backbone (str) – Teacher backbone model.
loss (nn.Module) – Loss function.

forward(images, masks=None, labels=None)#

Forward pass of the UniNet model.

Parameters:

images (torch.Tensor) – Input images.
masks (torch.Tensor | None) – Ground truth masks.
labels (torch.Tensor | None) – Ground truth labels.

Returns:

Loss or InferenceBatch.

Return type:

torch.Tensor | InferenceBatch

Loss functions for UniNet.

class anomalib.models.image.uninet.components.loss.UniNetLoss(lambda_weight=0.7, temperature=2.0)#

Bases: Module

Loss function for UniNet.

Parameters:

lambda_weight (float) – Hyperparameter for balancing the loss. Defaults to 0.7.
temperature (float) – Temperature for contrastive learning. Defaults to 2.0.

forward(student_features, teacher_features, margin=1, mask=None, stop_gradient=False)#

Compute the loss.

Parameters:

student_features (list[torch.Tensor]) – Student features.
teacher_features (list[torch.Tensor]) – Teacher features.
margin (int) – Hyperparameter for controlling the boundary.
mask (torch.Tensor | None) – Mask for the prediction. Mask is of shape Bx1xHxW
stop_gradient (bool) – Whether to stop the gradient into teacher features.

Return type:

Tensor

Utilities for computing anomaly maps.

anomalib.models.image.uninet.components.anomaly_map.weighted_decision_mechanism(batch_size, output_list, alpha, beta, output_size=(256, 256))#

Compute anomaly maps using weighted decision mechanism.

Parameters:

batch_size (int) – Batch size.
output_list (list[torch.Tensor]) – List of output tensors, each with shape [batch_size, H, W].
alpha (float) – Alpha parameter. Used for controlling the upper limit
beta (float) – Beta parameter. Used for controlling the lower limit
output_size (tuple[int, int]) – Output size.

Returns:

Anomaly score and anomaly map.

Return type:

tuple[torch.Tensor, torch.Tensor]

Attention Bottleneck for UniNet.

class anomalib.models.image.uninet.components.attention_bottleneck.AttentionBottleneck(inplanes, planes, stride=1, downsample=None, groups=1, base_width=64, norm_layer=None, attention=True, halve=1)#

Bases: Module

Attention Bottleneck block for UniNet with dual-branch processing.

This module implements a specialized bottleneck block that can operate in two modes: - Standard bottleneck (when halve=1) - Dual-branch attention mechanism (when halve=2)

The dual-branch mode splits input channels and processes them through different kernel sizes (3x3 and 7x7) to capture features at different receptive field scales, then fuses them before the final expansion layer.

Architecture:

Input: inplanes channels
Intermediate: width channels (compressed for efficiency)
Output: planes * expansion channels (expanded representation)

Parameters:

inplanes (int) – Number of input channels.
planes (int) – Base number of channels for intermediate processing. Final output will have planes * expansion channels.
stride (int, optional) – Stride for convolution layers. Defaults to 1.
downsample (nn.Module | None, optional) – Module for downsampling the residual connection when dimensions don’t match. Defaults to None.
groups (int, optional) – Number of blocked connections from input to output channels. Defaults to 1.
base_width (int, optional) – Base width for calculating intermediate channel width. Defaults to 64.
norm_layer (Callable[..., nn.Module] | None, optional) – Normalization layer to use. If None, uses BatchNorm2d. Defaults to None.
attention (bool, optional) – Whether to use attention mechanism. Defaults to True.
halve (int, optional) – Controls processing mode: - 1: Standard bottleneck processing - 2: Dual-branch processing with 3x3 and 7x7 kernels Defaults to 1.

channel_expansion#

Channel expansion factor. Final output channels will be planes * channel_expansion. Set to 4 following ResNet conventions.

Type:: int

Example

>>> import torch
>>> from anomalib.models.image.uninet.components.attention_bottleneck import (
...     AttentionBottleneck
... )
>>> # Standard bottleneck
>>> block = AttentionBottleneck(256, 64, halve=1)
>>> x = torch.randn(32, 256, 28, 28)
>>> output = block(x)
>>> output.shape  # Output: 32, 256, 28, 28 (64 * 4 = 256)
torch.Size([32, 256, 28, 28])

>>> # Dual-branch attention bottleneck
>>> block = AttentionBottleneck(512, 128, halve=2)
>>> x = torch.randn(32, 512, 14, 14)
>>> output = block(x)
>>> output.shape  # Output: 32, 512, 14, 14 (128 * 4 = 512)
torch.Size([32, 512, 14, 14])

Notes

When halve=2, input is split into two branches processed by 3x3 and 7x7 convolutions respectively to capture multi-scale features
The merge_kernel method can fuse batch norm parameters into convolution weights for inference optimization
Residual connections are used following ResNet design principles
The width calculation: int(planes * (base_width / 64.0)) * groups
The channel_expansion is set to 4 following ResNet conventions.

UniNet

Contents

UniNet#