https://github.com/alan-turing-institute/how-data-lies

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.7%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: alan-turing-institute
License: other
Default Branch: main
Size: 32.9 MB

Statistics

Stars: 5
Watchers: 3
Forks: 1
Open Issues: 1
Releases: 0

Created almost 4 years ago · Last pushed almost 3 years ago

Metadata Files

Readme License

How Learning to Lie with Data is Essential to Prevent AI being Sexist and Racist

Offering practical actionable support to data scientists who are making efforts to be responsible, while recognising why it is hard to do so.

The course materials are given on this GitHub page, with the course available here.

Course Summary

This course title "How learning to lie with data is essential to prevent AI from being sexist and racist." is intended to catch attention but also highlights the content of this course which intends to support data scientists looking to do responsible AI. The first part of the title comes from a book from 1954 titled "How to lie with statistics" which has been brought back into consciousness through another book "Rebooting AI". The first part of this course presents elements of how data can be misleading, while providing concrete tips to identify and address these data issues. The second part of the title refers to a series of recent scandals where it is argued that AI has not been used responsibly. These scandals, some of which are used as case studies in this course, are leading to the legislation coming in to ensure ethical uses of AI. The second half of this course is focussed on these ethical considerations needed for using AI responsibly. The course aims to support Data Scientists and their managers to increase their understanding of potential ethical challenges in the application of AI and provide concrete tips to support them to be responsible.

Who is it for?

This course is designed primarily for Data Scientists who are actively looking to be responsible in their work. Part of it is also intended to be appropriate for managers of data scientists or even their collaborators who may benefit from the broad discussions but skip some of the practical details.

Learning Objectives

By the end of this course, learners will have: * an awareness of some ethical considerations which are shaping the future of AI and why data scientists need to be responsible in their role. * been exposed to some common pitfalls where data mis-interpretation can arise and be presented with concrete advice to avoid them.

Learners may have: * gained practical experience working with data to draw correct conclusions in data containing complexities.

Course Details

Introduction
Data Considerations We have three approaches to consume the content in this section - A Content Approach, A Case Study Approach, A Practical Approach. All three approaches will cover the same case studies and content blocks, which are:
2a- Content
Module 1 - Definitions Matter
Module 2 - Data Matters
Module 3 - Variability Matters
Module 4 - Interactions Matter
2b- Case Studies
i) COMPAS Case Study
ii) Apple and Amazon Case Study
iii) Ofqual Case Study
iv) Protein Folding Case Study
2c- Practical Approach
Interactive example to consume the content using STACK.
Ethical Considerations
3a- Introducing ethics in AI
3b- Fairness and debiasing
3c- AI ethics beyond debiasing
3d- Accreditation
Conclusion

Behind the Course

This course was developed with The Alan Turing Institute and IDEMS International, in collaboration with partners from AI Ghana, Universitat Bonn, Center for Science and Thought, Zertifizierte KI, Lancaster University, and Caltech

Owner

Name: The Alan Turing Institute
Login: alan-turing-institute
Kind: organization
Email: info@turing.ac.uk

Website: https://turing.ac.uk
Repositories: 477
Profile: https://github.com/alan-turing-institute

The UK's national institute for data science and artificial intelligence.

GitHub Events

Total

Last Year

Issues and Pull Requests

Last synced: over 1 year ago

All Time

Total issues: 2
Total pull requests: 10
Average time to close issues: 23 days
Average time to close pull requests: less than a minute
Total issue authors: 1
Total pull request authors: 1
Average comments per issue: 0.0
Average comments per pull request: 0.0
Merged pull requests: 10
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science