https://github.com/annahedstroem/eval-project

Science Score: 46.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
✓
Committers with academic emails
1 of 2 committers (50.0%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.0%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: annahedstroem
Language: Jupyter Notebook
Default Branch: main
Size: 208 MB

Statistics

Stars: 4
Watchers: 3
Forks: 0
Open Issues: 0
Releases: 0

Created over 3 years ago · Last pushed over 1 year ago

Metadata Files

Readme

A method to estimate explanation quality

PyTorch

This repository contains the code and experiments for the paper "Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution" (AAAI, 2025 alignment track!) by Eiras-Franco et al., 2025.

Please note that this repository is under active development!

Citation

If you find this work interesting or useful in your research, use the following Bibtex annotation to cite us:

bibtex @article{gqe2025, title={Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution}, author={Carlos Eiras-Franco and Anna Hedström and Marina M. -C. Höhne}, booktitle={Proceedings of the AAAI Conference on Artificial Intelligence}, volume={39} year={2025}, }

Paper highlights 📚

The Quality gap estimator helps determine if a given explanation for a machine learning model prediction for a particular input is better or worse than alternative explanations attending to any quality metric.

Example:

For input $\mathbf{x0}$, your model $f$ predicts output $\mathbf{y0}$. You use an explanation method (for instance, Integrated gradients) to get an explanation $\mathbf{e}$, consisting of an attribution value for each variable $x_i$ in $\mathbf{x}$.

The problem Now you want to evaluate whether $\mathbf{e}$ is a good explanation. You use a function $\psi(\mathbf{x0},\mathbf{y0},f,\mathbf{e})$ to determine some quality of the explanation (for this example, let's say it's Faithfulness using FaithfulnessCorrelation). This yields a scalar value that measures the faithfulness of $\mathbf{e}$; let's say it's 4. Is your explanation more faithful than other explanations $\mathbf{e}'$? Maybe the most explanations for $f(\mathbf{x0})=\mathbf{y0}$ have a faithfulness of around 4. Or maybe 4 is a very exceptional faithfulness value for the explanations of $f(\mathbf{x0})=\mathbf{y0}$, which usually have faithfulness around 0.5. How can you determine if $\mathbf{e}$ has exceptional faithfulness when compared to alternative explanations?

The solution You could sample many alternative explanations to get a sense of what the distribution of the faithfulness value is for different explanations. However, this is very costly, since running FaithfulnessCorrelation is computationally expensive.

QGE allows you to obtain a reliable estimation of how exceptionally faithful (or any other quality metric you may be using) $\mathbf{e}$ by running FaithfulnessCorrelation (or your desired metric) just once more.

How to use QGE 👩‍💻🧑‍💻

This project allows for the replication of the experiments in the paper.

Please note that this is only the code for the experiments. To use QGE, please refer to its implementation in the Quantus framework.

Instructions to replicate the experiments 🧪

Replicating one of the results generally consists of four steps:

Get the data & train the model (notebooks in nbs/0-Model training)
Compute the target measures (src/extract_measures.py)
Compute the results for the paper (src/2-measure-performance.py, src/get_deltas.py & src/get_qstds.py)
Generate plots if necessary (nbs/3-plot.ipynb)

Details for each specific result

Section 4.1.a

1. **Get model & data:** `nbs/0-Model training/0-avila-train.ipynb` & `nbs/0-Model training/0-avila-train.ipynb` 2. **Compute measures:** `src/1-extract_measures.py` (line 90) 3. **Compute results:** `src/2-measure-performance.py` (line 72) 4. **Figures 2 & 3:** `nbs/3-plot.ipynb` (uncomment line in first cell)

Section 4.1.b

1. **Get models & data:** - **20newsgroups:** `nbs/0-Model training/0-20newsgroups-train.ipynb` - **MNIST:** `nbs/0-Model training/0-MNIST-train.ipynb` - **CIFAR:** `nbs/0-Model training/0-CIFAR-train.ipynb` - **Imagenet:** Not needed 2. **Compute measures:** `src/1-extract_measures.py` (line 94) 3. **Compute results:** `src/2-measure-performance.ipynb` (line 76) 4. **Aggregate results:** - **Table 1:** `src/3-get_deltas.py` (line 8)

Section 4.1.c

1. **Get models & data:** - **ood-mean & ood-zeros:** `nbs/0-Model training/0-avila-train-ood.ipynb` & `nbs/0-Model training/0-avila-train-ood.ipynb` - **undertrained & untrained:** `nbs/0-Model training/0-avila-train.ipynb` & `nbs/0-Model training/0-avila-train.ipynb` - Stop training when test accuracy reaches 70% for undertrained - Save weights with no training for untrained 2. **Compute measures:** `src/1-extract_measures.py` (line 105) 3. **Compute results:** `src/2-measure-performance.ipynb` (line 91) 4. **Aggregate results:** - **Table 2:** `src/3-get_deltas.py` (line 30) & `src/3-get_stds.py`

Section 4.1.d

1. **Get models & data:** `nbs/0-Model training/0-cmnist-train.ipynb` 2. **Compute measures:** - **Faithfulness:** `src/1-extract_measures_only_quantus.py` (line 127) - **Localization:** `src/1-extract_measures_localization-cmnist.py` (line 122) 3. **Compute results:** `src/2-measure-performance.ipynb` (lines 106 & 115) 4. **Aggregate results:** - **Table 3:** `src/3-get_deltas.py` (line 47) (Pixel-Flipping results taken from Table 1). - **Table 4:** `src/3-get-deltas.py` (line 57)

Section 4.2

Use `meta_evaluation.py` in "inverse-estimation" repository.

Appendix

A.1 - Figure 4: After having the results for Section 4.1.a, run `nbs/plot-qge-distribution.ipynb` A.2, A.4 - Figures 5 & 6: After having the results for Section 4.1.a, `nbs/3-plot.ipynb` (uncomment line in first cell) A.3 - Table 5: 1. Compute measures: `src/1-extract_measures.py` (line 118) 2. Compute results: `src/2-measure-performance.ipynb` (line 123) 3. Aggregate results: `src/3-get_deltas.py` (line 66) A.5 - Figures 7 & 8: use `meta_evaluation.py` in "inverse-estimation" repository.

Thank you

We hope our repository is beneficial to your work and research. If you have any feedback, questions, or ideas, please feel free to raise an issue in this repository. Alternatively, you can reach out to us directly via email for more in-depth discussions or suggestions.

📧 Contact us: - Carlos Eiras-Franco: carlos.eiras.franco@udc.es - Anna Hedström: hedstroem.anna@gmail.com

Thank you for your interest and support!

Owner

Name: Anna Hedström
Login: annahedstroem
Kind: user
Location: Berlin, Germany

Twitter: anna_hedstroem
Repositories: 29
Profile: https://github.com/annahedstroem

ML PhD student @TU-Berlin

GitHub Events

Total

Watch event: 2
Push event: 2
Public event: 1

Last Year

Watch event: 2
Push event: 2
Public event: 1

Committers

Last synced: 12 months ago

All Time

Total Commits: 82
Total Committers: 2
Avg Commits per committer: 41.0
Development Distribution Score (DDS): 0.049

Past Year

Commits: 13
Committers: 2
Avg Commits per committer: 6.5
Development Distribution Score (DDS): 0.154

Top Committers

Name	Email	Commits
Carlos Eiras	c**o@u**s	78
Anna Hedström	a**m@t**e	4

Committer Domains (Top 20 + Academic)

tu-berlin.de: 1 udc.es: 1

Issues and Pull Requests

Last synced: 12 months ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/annahedstroem/eval-project

Science Score: 46.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

A method to estimate explanation quality

Citation

Paper highlights 📚

Example:

How to use QGE 👩‍💻🧑‍💻

Instructions to replicate the experiments 🧪

Thank you

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels