all projects
Inspection
ViT-AnomalyDetection
A Vision-Transformer autoencoder for industrial inspection: reconstruct the image, then localize defects from patch-wise reconstruction error.
An unsupervised industrial-inspection model: train only on “good” parts, then flag anything the network cannot reconstruct well.
Approach
- ViT autoencoder. A pretrained Vision-Transformer encoder feeds a custom decoder that reconstructs the input image.
- Patch-wise error → localization. Anomalies are found where reconstruction error is high, producing a per-patch heatmap rather than just an image-level score.
- Why it matters. This maps directly to the kind of defect-detection work I did as a Computer Vision Engineer at Schwarz IT.