https://github.com/aresio/fanfair
Semi-automatic assessment of datasets fairness
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.9%) to scientific vocabulary
Repository
Semi-automatic assessment of datasets fairness
Basic Info
- Host: GitHub
- Owner: aresio
- License: afl-3.0
- Language: Python
- Default Branch: main
- Size: 80.1 KB
Statistics
- Stars: 1
- Watchers: 3
- Forks: 1
- Open Issues: 1
- Releases: 1
Metadata Files
README.md
FanFAIR
Semi-automatic assessment of datasets fairness
What is FanFAIR
FanFAIR is a rule-based approach based on fuzzy logic able to calculate some fairness metrics over a dataset and combine them into a single score, enabling a semi-automatic evaluation of a dataset in algorithmic fairness research.
Using FanFAIR
FanFAIR is designed to be as automatic as possible. However, two metrics (quality, compliance) require human intervention. Here is an example of analysis performed with FanFAIR:
``` from fanfair import FanFAIR
FF = FanFAIR(dataset="myfile.csv", outputcolumn="output") FF.setcompliance( {"dataprotectionlaw": True, "copyrightlaw": True, "medicallaw": True, "nondiscriminationlaw": False, "ethics": False}) FF.setquality(0.9) FF.producereport() ```
The analysis is automatically performed by calling the produce_report method, which generates two main figures: the gauge with the overall fairness score (from 0% to 100%), and the plots of the linguistic variables of the fuzzy model, which provide a summary of the metrics for the dataset's fairenss features.
Citing FanFAIR
If you find FanFAIR useful for your research, please cite our project as follows:
Gallese C., Scantamburlo T., Manzoni L., Nobile M.S.: Investigating Semi-Automatic Assessment of Data Sets Fairness by Means of Fuzzy Logic, Proceedings of the 20th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (IEEE CIBCB 2023), 2023
If you need additional information, or want to see additional metrics implemented in FanFAIR, please feel free to contact Dr. Chiara Gallese (chiara.gallese@unito.it).
Acknowledgements

Owner
- Name: Marco S. Nobile
- Login: aresio
- Kind: user
- Location: Venice, Italy
- Company: Ca' Foscari University
- Website: http://msnobile.it
- Twitter: aresio
- Repositories: 18
- Profile: https://github.com/aresio
I have a BS, MS and Ph.D. in Computer Science. I am a Associate Professor at the Ca' Foscari University of Venice
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1
Packages
- Total packages: 1
-
Total downloads:
- pypi 108 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 2
- Total maintainers: 1
pypi.org: fanfair
FanFAIR, semi-automatic assessment of datasets fairness
- Homepage: https://github.com/aresio/FanFAIR
- Documentation: https://fanfair.readthedocs.io/
- License: LICENSE.txt
-
Latest release: 1.0.2
published about 2 years ago
Rankings
Maintainers (1)
Dependencies
- actions/checkout v3 composite
- actions/setup-python v3 composite
- pypa/gh-action-pypi-publish 27b31702a0e7fc50959f5ad993c78deac1bdfc29 composite
- matplotlib *
- numpy *
- pandas *
- scipy *
- simpful *
- actions/checkout v4 composite
- actions/setup-python v4 composite
- pypa/gh-action-pypi-publish 27b31702a0e7fc50959f5ad993c78deac1bdfc29 composite