https://github.com/aresio/fanfair

Semi-automatic assessment of datasets fairness

https://github.com/aresio/fanfair

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Semi-automatic assessment of datasets fairness

Basic Info
  • Host: GitHub
  • Owner: aresio
  • License: afl-3.0
  • Language: Python
  • Default Branch: main
  • Size: 80.1 KB
Statistics
  • Stars: 1
  • Watchers: 3
  • Forks: 1
  • Open Issues: 1
  • Releases: 1
Created over 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License

README.md

FanFAIR

Semi-automatic assessment of datasets fairness

What is FanFAIR

FanFAIR is a rule-based approach based on fuzzy logic able to calculate some fairness metrics over a dataset and combine them into a single score, enabling a semi-automatic evaluation of a dataset in algorithmic fairness research.

Using FanFAIR

FanFAIR is designed to be as automatic as possible. However, two metrics (quality, compliance) require human intervention. Here is an example of analysis performed with FanFAIR:

``` from fanfair import FanFAIR

FF = FanFAIR(dataset="myfile.csv", outputcolumn="output") FF.setcompliance( {"dataprotectionlaw": True, "copyrightlaw": True, "medicallaw": True, "nondiscriminationlaw": False, "ethics": False}) FF.setquality(0.9) FF.producereport() ```

The analysis is automatically performed by calling the produce_report method, which generates two main figures: the gauge with the overall fairness score (from 0% to 100%), and the plots of the linguistic variables of the fuzzy model, which provide a summary of the metrics for the dataset's fairenss features.

Citing FanFAIR

If you find FanFAIR useful for your research, please cite our project as follows:

Gallese C., Scantamburlo T., Manzoni L., Nobile M.S.: Investigating Semi-Automatic Assessment of Data Sets Fairness by Means of Fuzzy Logic, Proceedings of the 20th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (IEEE CIBCB 2023), 2023

If you need additional information, or want to see additional metrics implemented in FanFAIR, please feel free to contact Dr. Chiara Gallese (chiara.gallese@unito.it).

Acknowledgements

FanFAIR is funded by the European Union

Owner

  • Name: Marco S. Nobile
  • Login: aresio
  • Kind: user
  • Location: Venice, Italy
  • Company: Ca' Foscari University

I have a BS, MS and Ph.D. in Computer Science. I am a Associate Professor at the Ca' Foscari University of Venice

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 108 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 2
  • Total maintainers: 1
pypi.org: fanfair

FanFAIR, semi-automatic assessment of datasets fairness

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 108 Last month
Rankings
Dependent packages count: 10.8%
Average: 35.7%
Dependent repos count: 60.7%
Maintainers (1)
Last synced: over 1 year ago

Dependencies

.github/workflows/python-publish.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v3 composite
  • pypa/gh-action-pypi-publish 27b31702a0e7fc50959f5ad993c78deac1bdfc29 composite
setup.py pypi
  • matplotlib *
  • numpy *
  • pandas *
  • scipy *
  • simpful *
.github/workflows/python-publish2.yml actions
  • actions/checkout v4 composite
  • actions/setup-python v4 composite
  • pypa/gh-action-pypi-publish 27b31702a0e7fc50959f5ad993c78deac1bdfc29 composite