Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.3%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Rossi-Laboratory
- License: mit
- Language: Python
- Default Branch: main
- Size: 93.8 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Implicit Knowledge Distillation Diffusion Transformer (IKDDiT)
Paper | Project Page | Video | Code
This project implements IKDDiT: a Teacher–Student Diffusion Transformer based on an implicit discriminator, designed for photolithography overlay map generation with computational acceleration advantages.
🎉 ICCV 2025 Accepted Paper
This paper has been accepted for presentation at the International Conference on Computer Vision 2025, Honolulu, Hawaii.
Key Features
- Implicit Discriminator: During training, the Teacher DiT sees all image patches, while the Student DiT sees only visible patches (others are masked), with token alignment guided by an implicit discriminator.
- Inference Acceleration: Only the Student DiT is used during inference, denoising and reconstructing a small number of non-masked patches, significantly reducing inference time.
- Unified Contrastive Embedding: Embeds image data, equipment logs, and barcode IDs into a shared space, aligned via InfoNCE loss.
- Gated Cross-Attention: Fuses condition tokens with latent maps to improve reconstruction quality.
Repository Structure
See the directory tree for full structure details.
Installation
bash
conda env create -f environment.yml
conda activate ikddit
or
bash
pip install -r requirements.txt
Training
- Download the dataset:
bash cd data && bash download_mpom.sh && cd .. - Train the model (Teacher + Student + Implicit Discriminator):
bash python src/train.py --config configs/ikddit_s.yaml
Hyperparameters
mask_ratio(float): percentage of patches masked during Student encoding (default: 0.5).- Ablation study results for
mask_ratio(FID-15k):
| Mask Ratio | FID-15k | | ---------- | ------- | | 0% | 27.46 | | 25% | 26.06 | | 50% | 24.66 | | 70% | 123.85 |
Optimal performance is achieved at a 50% mask ratio.
Loss Function
- Eq.8: LIKDDiT = LDSM + λ1 * LMAE + λ2 * LD
Loss components: 1. DSM: Denoising Score Matching. 2. MAE: Mean Absolute Error (L1), for reconstruction error. 3. Discriminator Loss: Implicit discriminator-guided alignment.
Hyperparameters:
- lambda1 (float): Weight for MAE.
- lambda2 (float): Weight for discriminator.
All parameters can be set in configs/ikddit_s.yaml.
Inference
Run only the Student DiT Encoder + Decoder:
bash
python src/inference.py --model checkpoints/student_ikddit.pth --mask_ratio 0.5
Visualization
notebooks/demo.ipynb: Demonstrates alignment loss during training, σ heatmap during inference, and speed-up benchmarks.
Citation
bibtex
@inproceedings{anonymous2025ikddit,
title={Photolithography Overlay Map Generation with Implicit Knowledge Distillation Diffusion Transformer},
author={Anonymous},
booktitle={ICCV},
year={2025}
}
Owner
- Name: Rossi Lab
- Login: Rossi-Laboratory
- Kind: organization
- Repositories: 1
- Profile: https://github.com/Rossi-Laboratory
Rossi Lab @ NYCU
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite as below:"
title: "Photolithography Overlay Map Generation with Implicit Knowledge Distillation Diffusion Transformer (IKDDiT)"
authors:
- family-names: "Yang"
given-names: "YuanFu"
date-released: "2025-05-01"
version: "1.0.0"
doi: ""
url: "https://github.com/Rossi-Laboratory/IKDDiT"
GitHub Events
Total
- Public event: 1
- Push event: 51
- Create event: 2
Last Year
- Public event: 1
- Push event: 51
- Create event: 2
Dependencies
- einops *
- Pillow *
- einops *
- matplotlib *
- numpy *
- pyyaml *
- tensorboard *
- torch >=1.10.0
- torchvision *
- tqdm *