https://github.com/1587causalai/awesome-causality-data
A data index for learning causality.
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.4%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
A data index for learning causality.
Basic Info
Statistics
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of rguo12/awesome-causality-data
Created about 7 years ago
· Last pushed about 7 years ago
https://github.com/1587causalai/awesome-causality-data/blob/master/
# awesome-causality-data
An index of datasets that can be used for learning causality.
Please cite our survey if this data index helps your research.
@article{guo2018survey,
title={A Survey of Learning Causality with Data: Problems and Methods},
author={Guo, Ruocheng and Cheng, Lu and Li, Jundong and Hahn, P. Richard and Liu, Huan},
journal={arXiv preprint arXiv:1809.09337},
year={2018}
}
*Updates coming soon*
## Datasets for Learning Causal Effects (Causal Inference)
### With Back-door Criterion
#### Standard Datasets for Causal Inference
Standard datasets for learning causal effects comes with each instance in the format of (**x**,d,y).
[IHDP1](https://github.com/AMLab-Amsterdam/CEVAE/tree/master/datasets/IHDP)
[How is IHDP1 (setting A) simulated](https://github.com/vdorie/npci/tree/master/examples/ihdp_sim)
[IHDP2](https://math.la.asu.edu/~prhahn/)
[Twins](https://github.com/AMLab-Amsterdam/CEVAE/tree/master/datasets/TWINS)
[Job Training](http://users.nber.org/~rdehejia/data/nswdata2.html) (Lalonde 1986)
[ACIC Benchmark](https://github.com/vdorie/aciccomp/tree/master/2016)
#### Non-i.i.d. Datasets for Causal Inference
[Amazon](https://drive.google.com/drive/u/1/folders/1Ff_GdfjhrDFbZiRW0z81lGJW-cUrYmo1)
### Without Back-door Criterion
#### Datasets with instrumental Variables (IV)
Standard datasets for learning causal effects, each instance has the format of (i,**x**,d,y).
[1980 Census Extract](https://economics.mit.edu/faculty/angrist/data1/data/angkru95)
[CPS Extract](https://economics.mit.edu/faculty/angrist/data1/data/angkru95)
#### Datasets for Regression Discontinuity Design
[Population Threshold RDD Datasets](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/PGXO5O)
## Datasets for Learning Causal Relationships (Causal Discovery)
#### Distinguishing Cause from Effect
[Database with cause-effect pairs](http://webdav.tuebingen.mpg.de/cause-effect/)
## Datasets for Connections to Machine Learning
### Datasets with randomized test set for recommendation systems
|Name|Paper|URL|
|---|---|---|
|Coat|Schnabel, Tobias, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. "Recommendations as treatments: Debiasing learning and evaluation." arXiv preprint arXiv:1602.05352 (2016).|[download](http://www.cs.cornell.edu/~schnabts/mnar/index.html)|
|Yahoo! R3| |[download](https://webscope.sandbox.yahoo.com/catalog.php?datatype=r)|
Owner
- Name: Heyang Gong
- Login: 1587causalai
- Kind: user
- Repositories: 1
- Profile: https://github.com/1587causalai
1587causalai
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1