Synthetic Defect Generation Comprehensive Survey Report
Executive Summary
Anomaly detection has the advantage that it can be trained using only good-product images; however, NG images are indispensable for threshold adjustment and performance evaluation, and when only one or two NG images can be obtained in the field, sufficient operation is difficult. This report comprehensively analyzes the latest research trends in Synthetic Defect Generation, which solves this issue, from three perspectives: web research, academic papers, and community practice, covering 2023 to 2026.
Most important finding: Since 2023, synthetic defect generation has diverged into four technical trends. (1) Diffusion-based methods (AnomalyDiffusion, SeaS, DefectFill, etc.) have become mainstream in terms of image quality, (2) feature-space synthesis methods (GLASS, CRAS) are the strongest in compatibility with PatchCore, (3) copy-paste methods (DRAEM, NSA) have become established as lightweight implementation baselines, and (4) few-shot-driven methods (TF-IDG, etc.) pursue practicality by generating from a single NG image. In particular, GLASS, presented at ECCV 2024, synthesizes anomalies in feature space using only normal images and achieved AUROC 99.9% on MVTec AD. It received the highest evaluation in two out of three sources as the method that can be most directly used for PatchCore threshold design.
Implications for practical operation: A workflow for designing thresholds using synthetic NG is promising, but there is a fundamental issue that the distribution gap between synthetic NG and real NG has not been systematically verified (explicitly pointed out only by academic sources). Thresholds determined using synthetic NG must always undergo final validation with real NG. As a recommended step-by-step introduction flow, this report proposes Phase 1: GLASS (no NG required) → Phase 2: TF-IDG/AnomalyDiffusion (expansion with one to two NG images) → Phase 3: SeaS + AnomalousPatchCore (advanced use with five or more NG images).
Risks to note: The domain gap problem, in which high accuracy on MVTec AD (97 to 99%) does not directly translate to actual factories, was pointed out by all three sources, and quality control of synthetic defects and review by domain experts are indispensable.
1. Overall Picture of Research Trends
1.1 Technical Transition from 2021 to 2026
Confidence: High (agreement across all three sources)
From 2021 to 2022, copy-paste-based methods such as CutPaste, NSA, and DRAEM appeared, establishing a framework that uses synthetic defects for learning from only good-product images. Since 2023, high-quality generation using Diffusion Models has become mainstream, and at the same time Feature-space synthesis (anomaly synthesis in feature space) has emerged as a method for direct integration with PatchCore.
2021 ── CutPaste (CVPR), DRAEM (ICCV): Establishment of copy-paste methods
2022 ── PatchCore (CVPR): Establishment of memory-bank-based AD SOTA, NSA (ECCV)
2023 ── Rise of diffusion-based methods, Optimizing PatchCore (few-shot)
2024 ── GLASS (ECCV), RealNet (CVPR), AnomalyDiffusion (AAAI),
DualAnoDiff, AnomalousPatchCore, COFT-AD (IEEE TIP),
AnoGen (ECCV)
2025 ── SeaS (ICCV), TF-IDG (ICCV), DefectFill (CVPR),
CRAS (TII), Sequential PatchCore
2026 ── Transition period toward zero-shot use of foundation models
(SAM, CLIP, DINOv2)
1.2 Four Technical Trends
Confidence: High (agreement across all three sources)
| Trend | Representative methods | Maturity level | Compatibility with anomaly detection |
|---|---|---|---|
| Diffusion-based | AnomalyDiffusion, SeaS, DefectFill, TF-IDG | Rapidly developing | Indirect (image level) |
| Feature-space synthesis | GLASS, CRAS, COFT-AD | Promising at research level | Highest (direct operation in feature space) |
| Copy-paste-based | CutPaste, DRAEM, NSA, Sequential PatchCore | Mature | Medium (as preprocessing) |
| Few-shot-driven | AnomalousPatchCore, AnoGen | Developing | High (including direct PatchCore extension) |
1.3 Decline of GAN-based Methods
Confidence: High (agreement across all three sources)
GAN-based methods were mainstream before 2022, but due to the rise of diffusion-based methods, new research since 2023 has decreased significantly. Fundamental problems such as training instability, mode collapse, and difficulty in training with a small number of NG images have not been resolved, and the shift to diffusion-based methods is proceeding irreversibly. Their use is limited to cases such as CycleGAN-based domain transformation that continues to be used in semiconductor and PCB inspection.
2. Analysis by Method Category
2.1 Diffusion-based Methods (Mainstream from 2023 to 2025)
Confidence: High
A group of methods capable of generating high-quality and highly diverse defect images from a small number of NG images. They are based on Stable Diffusion / Latent Diffusion Models and specialize them for industrial defects by combining techniques such as spatial control, text control, and intensity control.
Strengths:
- Can generate visually natural defect images, making quality confirmation by humans easy [Community]
- Since they are generated with masks, they can also be used for segmentation evaluation [Academic]
- Many methods are explicitly designed to work in few-shot settings (one to five NG images) [Web, Academic]
Weaknesses:
- Fine-tuning requires high GPU cost (A100/V100 equivalent, several hours to one day) [Community]
- They do not directly operate the feature space of PatchCore, and their effect is indirect [Academic, Cross-validation]
- Generation of minute defects (one to five pixels) can become unstable during the denoising process [Academic]
| Method | Characteristics | Recommended scenario |
|---|---|---|
| AnomalyDiffusion (AAAI 2024) | Position control through spatial embedding. Highly reliable | When defect types are clear |
| DualAnoDiff (2024) | Strengthens background consistency with two branches | When high-quality generation is prioritized |
| SeaS (ICCV 2025) | Synthesizes unknown defects through attribute decomposition | When defect attributes are diverse |
| RealNet (CVPR 2024) | Synthesis with intensity control | Continuous generation from weak to obvious defects |
| DefectFill (CVPR 2025) | Precise local defects through inpainting | High-quality generation from a small number of reference defects |
| TF-IDG (ICCV 2025) | Training-free, one-shot | Lowest introduction barrier |
2.2 Feature-space Anomaly Synthesis Methods (Highest Compatibility with PatchCore)
Confidence: High
An approach that directly synthesizes anomalies in feature-vector space rather than image space. Since it operates in the same feature representation space as PatchCore’s memory bank, it can be used most directly for threshold design. It has been increasing rapidly from 2024 to 2025.
Strengths:
- Operates in the same space as PatchCore’s memory bank and directly connects to threshold calibration [Web, Academic, Community]
- Many methods work even without NG images (normal images only) [Academic]
- Can target and generate “weak defects” near the boundary of the normal distribution, making it strong for minute defects [Academic]
Weaknesses:
- Since the generated results are feature vectors, “visualization” is difficult, making intuitive quality confirmation by humans difficult [Academic]
- Hyperparameter adjustment for Gradient Ascent is necessary, and domain-specific tuning is required [Community]
| Method | Characteristics | Recommended scenario |
|---|---|---|
| GLASS (ECCV 2024) | Gradient Ascent + GAS/LAS. SOTA for minute defects | Best for PatchCore threshold design |
| CRAS (TII 2025) | Residual-based synthesis. Multi-class support | Unified management of multiple product categories |
| COFT-AD (IEEE TIP 2024) | Feature-space shaping through contrastive learning | Backbone enhancement without NG |
Relationship between GLASS and CRAS: Both methods are studies by the same author group (cqylunlun). GLASS is the representative feature-space synthesis method presented at ECCV 2024, and CRAS can be positioned as a version extended for multi-class support in TII 2025. When selecting an implementation, GLASS is suitable for single-class inspection, and CRAS is suitable for unified management of multiple classes.
2.3 Copy-paste Methods (Lightweight and Easy to Implement)
Confidence: High
The simplest approach for generating synthetic defects by cutting and pasting patches from good-product images or by overlaying external textures.
Strengths: Easy implementation, no additional data required, low computational cost [all three sources]
Weaknesses: Synthetic defects tend to diverge from actual industrial defects (scratches, dents, contamination, cracks) in terms of appearance and feature distribution [agreement across all three sources]
“Random texture pasting in DRAEM-style methods works on paper benchmarks, but actual metal dents and cracks have natural boundaries with the background, and this kind of synthesis produces a completely different feature distribution.” — Practitioner community [Community]
2.4 Direct PatchCore Extension Methods
Confidence: High
A group of methods that strengthen PatchCore’s architecture and pipeline with minimal changes.
| Method | Approach | NG image requirement |
|---|---|---|
| AnomalousPatchCore (ECCV WS 2024) | Fine-tuning the feature extractor with NG images | One to several NG images |
| Sequential PatchCore (ECCV WS 2024) | Pre-training with synthetic impurities + fine-tuning with real data | Not required (replaced with synthesis) |
| Optimizing PatchCore (2023) | Hyperparameter tuning + leave-one-out threshold calibration | Not required |
3. Integrated Comparison Table of Major Papers
The following integrated evaluation reflects cross-validation results and corrects overestimation.
| # | Paper title | Year / Conference | Category | Paper link | GitHub link | Minute defects | Few NG | PatchCore compatibility | Evaluation dataset |
|---|---|---|---|---|---|---|---|---|---|
| 1 | GLASS | 2024 / ECCV | Feature-space | arXiv | GitHub | ★★★ | ★★★ | ★★★ | MVTec AD, VisA, MPDD |
| 2 | AnomalyDiffusion | 2024 / AAAI | Diffusion | arXiv | GitHub | ★★☆ | ★★★ | ★★☆ | MVTec AD |
| 3 | SeaS | 2025 / ICCV | Diffusion | arXiv | GitHub | ★★★ | ★★★ | ★★☆ | MVTec AD, MVTec 3D AD |
| 4 | RealNet | 2024 / CVPR | Diffusion+Feature | arXiv | GitHub | ★★★ | ★★☆ | ★★☆ | MVTec AD, VisA, MPDD, BTAD |
| 5 | DefectFill | 2025 / CVPR | Diffusion (Inpainting) | arXiv | Unreleased | ★★★ | ★★★ | ★★☆ | MVTec AD |
| 6 | TF-IDG | 2025 / ICCV | Diffusion (Training-free) | CVF | Unconfirmed | ★★★ | ★★★ | ★★☆ | MVTec AD, VisA |
| 7 | AnomalousPatchCore | 2024 / ECCV WS | PatchCore extension | arXiv | Unreleased | ★★☆ to ★★★ | ★★★ | ★★★ | MVTec AD |
| 8 | DualAnoDiff | 2024 / arXiv | Diffusion | arXiv | GitHub | ★★★ | ★★★ | ★★☆ | MVTec AD, VisA |
| 9 | CRAS | 2025 / TII | Feature-space | arXiv | GitHub | ★★★ | ★★☆ | ★★★ | MVTec AD, VisA, MPDD, ITDD |
| 10 | Sequential PatchCore | 2025 / ECCV WS | Copy-paste+PatchCore | arXiv | Unreleased | ★★☆ | ★★★ | ★★★ | Proprietary surface inspection data |
| 11 | COFT-AD | 2024 / IEEE TIP | Feature-space | arXiv | Unreleased | ★★☆ | ★★★ | ★★★ | MVTec AD |
| 12 | AnoGen | 2024 / ECCV | Diffusion+Few-shot | arXiv | GitHub | ★★☆ | ★★★ | ★★☆ | MVTec AD |
| 13 | Optimizing PatchCore | 2023 | PatchCore extension | arXiv | GitHub | ★★☆ | ★★★ | ★★★ | MVTec AD, VisA |
| 14 | AdaBLDM | 2024 | Diffusion | arXiv | GitHub | ★★☆ | ★★☆ | ★★☆ | MVTec AD |
Note on evaluation criteria: Through cross-validation, items overestimated in the Community report (RealNet’s few-NG support, SeaS/TF-IDG’s PatchCore compatibility, etc.) have been corrected. ★★★ means strongest class for the relevant perspective, ★★☆ means practical but with constraints, and ★☆☆ means baseline level.
4. Detailed Explanation of Important Papers
4.1 GLASS (ECCV 2024) — Strongest Feature-space Synthesis
Confidence: High (Academic/Community agreement, highest evaluation through cross-validation)
Official title: A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and Localization
Synthesis method: Adopts a two-stage unified synthesis strategy.
- Global Anomaly Synthesis (GAS): Generates near-in-distribution anomaly vectors near the manifold boundary of normal features in feature space through Gradient Ascent + Gaussian noise. Truncated Projection prevents excessive distribution deviation.
- Local Anomaly Synthesis (LAS): Operates local textures/patterns at the image level and synthesizes realistic minute defects.
Support for minute defects: ★★★ Strongest class. Designed with “weak defect” detection as the core issue. MVTec AD AUROC 99.9% is a result of this.
Few-NG support: ★★★ Fully supported. Operates only with normal images. Can generate a large amount of synthetic NG for threshold calibration without requiring NG images.
PatchCore compatibility: ★★★ Highest. Operates in the same feature space as PatchCore’s memory bank and can be directly used for threshold design.
Advantages: Can operate with zero NG images. Strong for minute defects. Official code available.
Weaknesses: Since it is feature-space synthesis, “visualizing” the generated results is difficult. Hyperparameter adjustment for Gradient Ascent is necessary [Community].
4.2 AnomalyDiffusion (AAAI 2024) — Representative Diffusion-based Method
Confidence: High (agreement across all three sources)
Official title: AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model
Synthesis method: Based on a Latent Diffusion Model, it introduces Spatial Anomaly Embedding (separately encoding the appearance and location of anomalies) and Adaptive Attention Re-weighting (dynamic enhancement of attention during generation).
Support for minute defects: ★★☆ Medium. Small defects can also be generated through mask-based spatial control, but minute details tend to be lost during the denoising process of diffusion. (Note: The Web report evaluated it as ◎, but ★★☆ from both Academic and Community sources is more technically accurate.)
Few-NG support: ★★★ Few-shot design is an explicit objective. It fine-tunes an LDM from several NG images and generates diverse defect patterns.
Advantages: Since NG data is obtained as images, human confirmation and selection are easy. Masked generation can also be used for segmentation evaluation.
Weaknesses: LDM fine-tuning requires GPU time. Generation diversity depends on the number of few-shot images.
4.3 SeaS (ICCV 2025) — Latest Few-shot Generation through Attribute Decomposition
Confidence: High (Academic/Community agreement)
Official title: SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning
Synthesis method: Stable Diffusion fine-tuning based. Combines Separation (separate learning of anomaly tokens and normal tokens) and Sharing (generating both normal and anomalous images with the same U-Net). With Decoupled Anomaly Alignment Loss, anomaly attributes are decomposed and recombined into multiple tokens, enabling synthesis of unknown anomaly patterns.
Support for minute defects: ★★★ Synthesizes diverse minute defects such as small scratches, contamination, and unevenness through attribute decomposition. Generates high-resolution masks.
Few-NG support: ★★★ Strongest class. Can extract and recombine diverse defect attributes from several images.
PatchCore compatibility: ★★☆ Indirect. It is reported that generated normal/anomaly pairs improve unsupervised AD (including PatchCore) by +1.10% image-level AP and +8.66% pixel AP.
Advantages: Since normal images can also be generated at the same time, it can be used to augment the entire training dataset.
Weaknesses: Because it is based on Stable Diffusion, it requires a large amount of VRAM.
4.4 DefectFill (CVPR 2025) — Inpainting-type Precise Defect Generation
Confidence: Medium-High (Web/Community agreement, not mentioned in Academic)
Official title: DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection
- Paper: https://arxiv.org/abs/2503.13985
- GitHub: Unreleased (because it was presented in 2025)
Synthesis method: Fine-tunes an Inpainting Diffusion Model specifically for defects. Improves generation quality through a custom loss function combining defect loss, object loss, and attention loss, plus Low-Fidelity Selection.
Few-NG support: ★★★ Explicitly designed for generation from a small number of reference defect samples.
Weaknesses: Presented at CVPR 2025 and the official GitHub has not been released. Reproducibility has not yet been verified.
4.5 TF-IDG (ICCV 2025) — Training-free One-shot Generation
Confidence: Medium-High (Web/Community agreement)
Official title: Training-Free Industrial Defect Generation with Diffusion Models
- Paper: CVF PDF
- GitHub: https://github.com/rubymiaomiao/TF-IDG
Synthesis method: A training-free defect generation framework. Operates in a one-shot setting with a single NG image.
Greatest advantage: Since fine-tuning is unnecessary, GPU cost is low, and the barrier to field introduction is the lowest.
Weaknesses: GitHub unconfirmed, and community track record is limited.
4.6 AnomalousPatchCore (ECCV 2024 Workshop) — Direct PatchCore Extension
Confidence: High (Academic/Community agreement)
Official title: AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection
- Paper: https://arxiv.org/abs/2408.15113
- GitHub: Unreleased
Synthesis method/approach: Fine-tunes PatchCore’s feature extractor through contrastive learning using normal + anomalous samples. Supports three auxiliary tasks: anomaly classification, localization, and class imbalance correction.
PatchCore compatibility: ★★★ Directly compatible. Can strengthen an existing PatchCore pipeline with minimal changes.
Weaknesses: Code has not been released, so reproduction implementation is required. Countermeasures against fine-tuning instability with few NG images are limited.
4.7 RealNet (CVPR 2024) — Intensity-controllable Diffusion Synthesis
Confidence: High (mentioned by all three sources, but evaluations conflict)
Official title: RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection
Synthesis method: Synthesizes anomalies by continuously controlling anomaly intensity using SDAS (Strength-controllable Diffusion Anomaly Synthesis). It has a three-component structure combining AFS (feature selection) and RRS (residual selection).
Important note: The Community report evaluated RealNet as “no NG required (★★★),” but as a result of cross-validation, the Web report’s “large-scale synthesis is assumed (★★☆)” is more accurate. RealNet is designed to use 360k pre-synthesized images, making it difficult to use as a direct generation flow from a small number of NG images.
5. Compatibility Analysis with AI Anomaly Detection and Threshold Design
5.1 Three Roles of NG Images in AI Anomaly Detection
Confidence: High (agreement across all three sources)
- Threshold setting: Check the distribution of anomaly scores and adjust the balance between FP/FN
- Performance evaluation: Calculate AUROC, F1 score, AU-PR, etc.
- Defect pattern analysis: Identify which kinds of defects are difficult to detect
5.2 Compatibility Mapping with AI Anomaly Detection by Method
Confidence: High (cross-validation corrected)
Tier 1: Directly Integrable with AI Anomaly Detection (★★★)
| Method | Integration method | NG data requirement |
|---|---|---|
| GLASS | Normal images only → synthetic NG generation in feature space → direct use for PatchCore threshold design | Not required |
| CRAS | Multi-class feature-space synthesis → memory bank expansion | Not required |
| AnomalousPatchCore | Fine-tune the feature extractor with NG images | One to several images |
| Sequential PatchCore | Pre-training with synthetic impurities → coreset update with real data | Not required (replaced with synthesis) |
| Optimizing PatchCore | Leave-one-out + statistical threshold calibration | Not required |
| COFT-AD | Directly insert the fine-tuned backbone into PatchCore | Not required |
Tier 2: Indirectly Effective as Data Augmentation (★★☆)
| Method | Usage | Notes |
|---|---|---|
| AnomalyDiffusion | Synthetic NG images → threshold adjustment data | High fine-tuning cost |
| SeaS | High-quality synthetic NG + masks → evaluation data augmentation | Requires a large amount of VRAM |
| DefectFill | Generate realistic defect images from a small number of reference defects | GitHub unreleased |
| TF-IDG | Mass-generate synthetic NG with one-shot, training-free → immediate construction of evaluation data | Not specialized for PatchCore |
| DualAnoDiff | Synthetic NG with high background consistency → evaluation data | High computational cost due to two branches |
| RealNet | Generate boundary samples with intensity control | Requires 360k pre-synthesized images |
5.3 Fundamental Caution Regarding Threshold Design
Confidence: High (unique academic-source point, extremely important)
The extent to which a threshold determined using synthetic NG is effective for the actual NG distribution has not been systematically verified. Threshold determination methods are not standardized across papers, making comparison difficult. Synthetic-NG-based thresholds must always undergo final verification with real NG. [Academic]
This issue is directly connected to the core issue of this survey. All three sources recommend “threshold setting with synthetic NG,” but only the Academic report explicitly points out the fragility of this premise itself, so sufficient caution is required in practical operation.
5.4 Best Practices for Threshold Design
Confidence: Medium-High (integration of insights from multiple sources)
- Leave-one-out estimation: Estimate a beta distribution using only normal images and determine the initial threshold [Web, Community]
- Percentile-based: Set the 99.9th percentile of the anomaly score distribution of normal images as a provisional threshold [Community]
- ROC construction with synthetic NG: Generate 20 to 50 synthetic NG images with GLASS/TF-IDG and draw an ROC curve [Web]
- Final verification with real NG: Confirm and fine-tune the threshold derived from synthetic NG using one to two real NG images on hand [Academic]
- Review by domain experts: Inspectors perform final confirmation of the quality of synthetic NG and the validity of the threshold [Community]
6. Quantitative Benchmark Results
6.1 MVTec AD (Image-level AUROC)
Confidence: Medium-High (reported values from each paper. Note differences in experimental settings)
| Method | MVTec AD AUROC | Notes |
|---|---|---|
| PatchCore (baseline) | 99.1% | CVPR 2022 |
| GLASS | 99.9% | ECCV 2024, specialized for weak defects |
| RealNet | SOTA class | CVPR 2024 |
| SeaS (contribution to unsupervised AD) | +1.10% AP improvement | ICCV 2025 |
| AnoGen (contribution to DRAEM) | +5.8% AU-PR | ECCV 2024 |
| CRAS | 98.3% (Image), 98.0% (Pixel) | TII 2025 |
Note: MVTec AD is almost saturated with AUROC in the 97 to 99% range. The field is moving toward more difficult benchmarks (MVTec AD 2, VisA, MPDD, etc.). [Web, Academic]
6.2 List of Major Datasets
| Dataset | Characteristics | Number of categories | Difficulty |
|---|---|---|---|
| MVTec AD | Industrial parts and textures. Industry standard | 15 | Somewhat saturated |
| MVTec AD 2 | Eight new scenarios, 8000+ images | 8+ | High (released in 2025) |
| VisA | High-resolution industrial products | 12 | Higher than MVTec |
| MPDD | Dents and scratches on metal parts | 6 | Many minute defects |
| MVTec LOCO | Logical anomalies (misplacement, etc.) | 5 | Special |
| BTAD | Industrial products | 3 | Medium |
7. Practitioner Insights and Best Practices
7.1 Voices from the Field
Confidence: Medium-High (Community report only, but based on examples)
SEA Vision (IEEE ETFA 2025):
“By using Anti-Aliased WRN50 as the backbone, setting the input resolution to 448×448, and combining dedicated data augmentation, practical anomaly detection became possible even with only one normal image.”
DataRoots (PatchCore implementation blog 2024):
“PatchCore’s greatest strength is that it can be trained using only normal images. However, NG images are still required for threshold setting, which becomes a headache in the field from the beginning.”
Issue trends in the official PatchCore repository (amazon-science):
- “Is there a way to automatically determine the threshold?” — most common question category
- “When there are few NG images, the ROC curve cannot be drawn accurately” — common concern
7.2 Implementation Tuning Knowledge
Confidence: Medium-High [Community]
- Backbone: WideResNet50 is the most stable. Switching to DINOv2/ViT-based models is a recent trend
- Coreset ratio: 1 to 10% is sufficient (using all features is excessive)
- Input resolution: Adjust in the range of 224 to 448 px according to product size
- Integrated framework: anomalib is the most practical because it supports multiple methods including PatchCore
7.3 Typical Failure Patterns When Introducing Synthetic Defects
Confidence: Medium [Community]
- “Implemented the paper method as-is, but it did not work” → Domain adaptation to the company’s own product category is required
- “Using synthetic defects for training increased false positives” → Quality control and filtering of synthetic defects are essential
- “Tried diffusion fine-tuning but gave up due to insufficient GPU” → Training-free methods (TF-IDG, etc.) are realistic alternatives
8. Recommended Classification by Three Perspectives
8.1 Methods Strong in Minute Defect Generation (Scratches, Dents, Cracks, Foreign Matter, Unevenness)
Confidence: High
| Priority | Method | Category | Reason |
|---|---|---|---|
| ★★★ First recommendation | GLASS | Feature-space | Targets anomalies at the normal boundary through Gradient Ascent. Countermeasures for weak defects are the core of the design. Official code available [Academic, Community] |
| ★★★ Second recommendation | SeaS | Diffusion | Generates fine scratches, contamination, and unevenness with detailed control through attribute decomposition. High-resolution masks included [Academic] |
| ★★★ Third recommendation | DefectFill | Diffusion (Inpainting) | Fine defect generation through inpainting, quality assurance through Low-Fidelity Selection [Web] |
| ★★☆ Complementary | DualAnoDiff | Diffusion | Generates minute defects that do not “float” due to background consistency. High mask accuracy [Academic] |
| ★★☆ Complementary | RealNet | Diffusion+Feature | Continuous generation from weak to obvious defects through intensity-variable Diffusion [Academic, Web] |
Methods unsuitable for minute defects: Copy-paste methods (CutPaste, DRAEM) are often unsuitable for reproducing minute defects [agreement across all three sources]
8.3 Methods Suitable for AI Anomaly Detection Evaluation and Threshold Adjustment
Confidence: High
| Priority | Method | Specific usage |
|---|---|---|
| ★★★ | GLASS | Normal images only → synthetic NG generation in feature space → directly design PatchCore threshold |
| ★★★ | Optimizing PatchCore | Leave-one-out + statistical fitting for threshold calibration with almost no NG data required |
| ★★★ | AnomalousPatchCore | Fine-tune the feature extractor with one to two NG images → immediately strengthen PatchCore |
| ★★☆ | CRAS | Multi-class support. Unified threshold management for multiple product categories |
| ★★☆ | TF-IDG | One-shot, training-free mass generation of synthetic NG → immediate construction of evaluation data |
| ★★☆ | AnomalyDiffusion | Several NG images → generation of diverse synthetic NG → use as threshold adjustment data |
9 Reference Surveys and Awesome Lists
| Resource | URL | Content |
|---|---|---|
| A Survey on Industrial Anomalies Synthesis (2025) | arXiv | IAS taxonomy covering about 40 methods |
| Anomaly Detection with Diffusion Models Survey (2025) | arXiv / GitHub | Survey specialized in diffusion-based methods |
| awesome-industrial-anomaly-detection | GitHub | Comprehensive paper list for industrial AD |
| awesome-anomaly-synthesis | GitHub | List specialized in synthesis methods |
| Awesome-Few-Shot-Defect-Image-Generation | GitHub | Few-shot defect generation list |
| Awesome-Anomaly-Generation | GitHub | Hierarchical taxonomy of anomaly generation methods |
10. Unresolved Issues and Risks
10.1 Real-environment Effectiveness of Synthetic NG Thresholds (Most Important)
Confidence: High (unique academic-source point but a fundamental issue)
All sources recommend the flow of “adjusting PatchCore thresholds with synthetic NG,” but there is no systematic validation data on how effective thresholds determined using synthetic NG are for the actual NG distribution. This is the most important knowledge gap identified in this survey and concerns the reliability of the entire field of synthetic defect research.
Countermeasure: Treat synthetic NG thresholds as “initial setting values” and incorporate verification with real NG as a mandatory process.
10.2 Domain Gap Problem
Confidence: High (Academic/Community agreement)
There are many reported cases in which methods that achieved high accuracy on MVTec AD/VisA suffer a significant performance drop when applied to actual factory data.
“Even if it reaches 99% on MVTec, it can become only 60% on our own products.” — Practitioner [Community]
Beyond Academic Benchmarks (arXiv 2503.23451) analyzes this problem in detail.
10.3 Generation Quality Limitations for Ultra-minute Defects
Confidence: Medium (Academic only)
Scratches and dents on the scale of one to five pixels are unstable to generate even with diffusion models. Generation resolution and denoising randomness become inhibiting factors. Feature-space synthesis (GLASS, etc.) may be more suitable for this type of minute defect.
10.4 Lack of Quantitative Data on Computational Cost
Confidence: Medium
All sources lack quantitative comparisons of GPU time, memory usage, and inference speed for each method. Diffusion-based fine-tuning requires a GPU equivalent to A100/V100, and depending on the method, training time can take several hours to one day, but concrete benchmarks are not yet established.
10.5 Absence of a Standard Definition of Minute Defects
Confidence: Medium
The definition of “minute defect” is not standardized. There are no quantitative criteria such as definitions by pixel size or signal-to-noise ratio (SNR) relative to normal regions, making fair comparison between methods difficult.
10.6 Lack of Introduction Cases in Actual Factories
Confidence: Medium
There are case reports from SEA Vision, Edge Impulse, and others, but concrete accuracy values and actual threshold-adjustment performance data have not been made public.
Limitations:
- The investigation of each source is based on information as of April 17, 2026
- Self-reported benchmark values in papers are difficult to compare directly because of differences in experimental settings (backbone, resolution, etc.)
- The Community report tended to overestimate some methods, and corrections were made through cross-validation
- For the latest methods presented in 2025 (DefectFill, TF-IDG, SeaS), community track records are insufficient
- Quantitative success-case data in actual factories is limited
This report is based on multifaceted literature and practical research as of 2026-04-17. Because the technology is advancing rapidly, please check arXiv, GitHub, and Papers with Code as needed for the latest information.

