https://github.com/aiformedresearch/synthetic_images_metrics_toolkit

SIM Toolkit: 2D and 3D synthetic image quality assessment using quantitative and qualitative metrics.

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 3 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, zenodo.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.1%) to scientific vocabulary

Last synced: 6 months ago · JSON representation

Repository

SIM Toolkit: 2D and 3D synthetic image quality assessment using quantitative and qualitative metrics.

Basic Info

Host: GitHub
Owner: aiformedresearch
Language: Python
Default Branch: main
Homepage:
Size: 1.81 MB

Statistics

Stars: 5
Watchers: 0
Forks: 1
Open Issues: 0
Releases: 3

Created almost 2 years ago · Last pushed 6 months ago

Metadata Files

Readme License

SyntheticImagesMetrics_Toolkit

The Synthetic Images Metrics (SIM) Toolkit provides a comprehensive collection of state-of-the-art metrics for evaluating the quality of 2D and 3D synthetic images.

These metrics enable the assessment of: - Fidelity: the realism of synthetic data; - Diversity: the coverage of the real data distribution; - Generalization: the generation of authentic, non-memorized images.

📊 Automated Report Generation

This repository produces a comprehensive report as output, summarizing key findings and visualizations in a structured format.

Check out an example report here: 📄 reportmetricstoolkit.pdf

Installation

Before proceeding, ensure that CUDA is installed. CUDA 11.0 or later is recommended.

🔧 Option 1: Install via Anaconda

Install Anaconda for your operating system.
Clone the repository: cd /path_to/sim_toolkit_repo git clone https://github.com/aiformedresearch/Synthetic_Images_Metrics_Toolkit.git cd Synthetic_Images_Metrics_Toolkit
Create and activate the Conda environment: conda create -n sim_toolkit python=3.10 -y conda activate sim_toolkit
Install dependencies: pip install -r requirements.txt 📌 To install additional dependencies for the tutorials with the pre-trained generators: pip install -r requirements_tutorial.txt
Install any additional packages needed for your specific use case.

🐳 Option 2: Use Docker

Install Docker for your operating system.
Pull the Docker image docker pull aiformedresearch/metrics_toolkit:3.1
Run the Docker container docker run -it --gpus all \ -v /absolute/path/to/real_data:/Synthetic_Images_Metrics_Toolkit/data \ -v /absolute/path/to/config_file:/Synthetic_Images_Metrics_Toolkit/configs \ -v /absolute/path/to/local_output_directory:/Synthetic_Images_Metrics_Toolkit/outputs \ aiformedresearch/metrics_toolkit
- The --gpus all flag enables GPU support. Specify a GPU if needed, e.g., --gpus 0.
- The -v flag is used to mount the local directories to the working directory Synthetic_Images_Metrics_Toolkit inside the container.

Refer to the Usage section for detailed instructions about running the main script.

Usage

Step 1: Customize for your use case

To evaluate your generative model, tailor the configuration script to meet your specific needs.

📖 Find here a short tutorial to use the SIM Toolkit with your data.

Key configuration options include: - Metrics: Select metrics such as FID, KID, etc. - Runtime configurations: Set up your working directory; select CPU or GPU; define data type (2D, 3D). - Metrics configurations: Adjust the hyperparameters for metrics computation, if needed. - Real data configuration: Define the function for data loading. - Synthetic data configuration: Choose one of the following options: 1. Load synthetic images directly from files or a folder; 2. Define functions to load a pre-trained generator and generate new samples.

📝 Detailed guide to build the configuration file

Step 2: Run the SIM Toolkit

Once customized the config.py script, execute the main script with: python calc_metrics.py --config configs/config.py

Metrics overview

metrics

Quantitative metrics

The following quantitative metrics are available:

⚠️ 3D metrics rely on a 3D-ResNet50 feature extractor from MedicalNet, pre-trained on 23 medical imaging datasets. Before using these metrics, please verify that your data domain is represented among those datasets. If your dataset differs significantly from the pretraining domains, the resulting embeddings $-$ and therefore the computed metrics $-$ may not be meaningful.

References: 1. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium, Heusel et al. 2017 2. Demystifying MMD GANs, Bińkowski et al. 2018 3. Improved Techniques for Training GANs, Salimans et al. 2016 4. Improved Precision and Recall Metric for Assessing Generative Models, Kynkäänniemi et al. 2019 5. A Style-Based Generator Architecture for Generative Adversarial Networks, Karras et al. 2018 6. Reliable Fidelity and Diversity Metrics for Generative Models, Naeem et al., 2020 7. How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating and Auditing Generative Models, Alaa et al., 2022

Qualitative metrics

The SIM Toolkit automatically generates t-SNE and PCA visualizations of real and synthetic image embeddings for each metric, to qualitatively assess the fidelity and diversity of the synthetic images. These plots are included in the final report, which also provides detailed guidance to help users interpret the resulting metric scores. You can view an example report here.

In addition, the toolkit supports k-NN analysis, enabling users to qualitatively evaluate how well synthetic samples generalize with respect to real data:

| Metric flag | Description | Original implementation | | :----- | :-----: | :---------- | | knn | k-nearest neighbors (k-NN) analysis, to assess potential memorization of the model | Lai et al. |

knn-analysis

The k-NN analysis identifies and visualizes the top_n real images most similar to any synthetic sample (from a set of 50,000 generated samples). For each real image, the visualization displays the top k synthetic images ranked by their cosine similarity to the corresponding real image.

By default, k=5 and top_n=3. These parameters can be customized in the Metrics configurations section of the configuration file.

🚧To-do list:

[x] Simplify the procedure for data loading;
[x] Allow to load synthetic images from files (as well as from a pre-trained generator);
[x] 3D data support;
[x] Implement PCA and t-SNE to qualitatively assess diversity.

Licenses

This repository complies with the REUSE Specification. All source files are annotated with SPDX license identifiers, and full license texts are included in the LICENSES directory.

Licenses Used

LicenseRef-NVIDIA-1.0: Applies to code reused from NVIDIA's StyleGAN2-ADA repository: https://github.com/NVlabs/stylegan2-ada-pytorch, under the NVIDIA Source Code License.
MIT: For code reused from:
- https://github.com/vanderschaarlab/evaluating-generative-models;
- https://github.com/clovaai/generative-evaluation-prdc.
BSD-3-Clause: Applies to two scripts reused from https://github.com/vanderschaarlab/evaluating-generative-models;
NPOSL-3.0: Applies to the code developed specifically for this repository.

For detailed license texts, see the LICENSES directory.

Aknowledgments

This repository builds on NVIDIA's StyleGAN2-ADA repository: https://github.com/NVlabs/stylegan2-ada-pytorch.

Owner

Name: AI for Medicine Research Group
Login: aiformedresearch
Kind: organization

Repositories: 1
Profile: https://github.com/aiformedresearch

GitHub Events

Total

Release event: 3
Watch event: 7
Push event: 77
Fork event: 1
Create event: 2

Last Year

Release event: 3
Watch event: 7
Push event: 77
Fork event: 1
Create event: 2

Dependencies

Dockerfile docker

pytorch/pytorch 1.7.1-cuda11.0-cudnn8-runtime build

requirements.txt pypi

click ==7.1.2
imageio-ffmpeg ==0.4.3
matplotlib ==3.5.3
nibabel ==4.0.2
pandas ==1.3.5
psutil ==5.7.2
pyspng ==0.1.1
requests ==2.24
scikit-learn ==1.0.2
scipy ==1.5.2
tensorflow ==2.4.1
torch ==1.7.1
torchvision ==0.8.2
tqdm ==4.64.1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/aiformedresearch/synthetic_images_metrics_toolkit

Science Score: 49.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

SyntheticImagesMetrics_Toolkit

📊 Automated Report Generation

Installation

🔧 Option 1: Install via Anaconda

🐳 Option 2: Use Docker

Usage

Step 1: Customize for your use case

Step 2: Run the SIM Toolkit

Metrics overview

Quantitative metrics

Qualitative metrics

🚧To-do list:

Licenses

Licenses Used

Aknowledgments

Owner

GitHub Events

Total

Last Year

Dependencies