https://github.com/1587causalai/awesome-causality-data

A data index for learning causality.

https://github.com/1587causalai/awesome-causality-data

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

A data index for learning causality.

Basic Info
  • Host: GitHub
  • Owner: 1587causalai
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: master
  • Homepage:
  • Size: 18.6 KB
Statistics
  • Stars: 2
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Fork of rguo12/awesome-causality-data
Created about 7 years ago · Last pushed about 7 years ago

https://github.com/1587causalai/awesome-causality-data/blob/master/

# awesome-causality-data
An index of datasets that can be used for learning causality.

Please cite our survey if this data index helps your research.

@article{guo2018survey,
title={A Survey of Learning Causality with Data: Problems and Methods},
author={Guo, Ruocheng and Cheng, Lu and Li, Jundong and Hahn, P. Richard and Liu, Huan},
journal={arXiv preprint arXiv:1809.09337},
year={2018}
} *Updates coming soon* ## Datasets for Learning Causal Effects (Causal Inference) ### With Back-door Criterion #### Standard Datasets for Causal Inference Standard datasets for learning causal effects comes with each instance in the format of (**x**,d,y). [IHDP1](https://github.com/AMLab-Amsterdam/CEVAE/tree/master/datasets/IHDP) [How is IHDP1 (setting A) simulated](https://github.com/vdorie/npci/tree/master/examples/ihdp_sim) [IHDP2](https://math.la.asu.edu/~prhahn/) [Twins](https://github.com/AMLab-Amsterdam/CEVAE/tree/master/datasets/TWINS) [Job Training](http://users.nber.org/~rdehejia/data/nswdata2.html) (Lalonde 1986) [ACIC Benchmark](https://github.com/vdorie/aciccomp/tree/master/2016) #### Non-i.i.d. Datasets for Causal Inference [Amazon](https://drive.google.com/drive/u/1/folders/1Ff_GdfjhrDFbZiRW0z81lGJW-cUrYmo1) ### Without Back-door Criterion #### Datasets with instrumental Variables (IV) Standard datasets for learning causal effects, each instance has the format of (i,**x**,d,y). [1980 Census Extract](https://economics.mit.edu/faculty/angrist/data1/data/angkru95) [CPS Extract](https://economics.mit.edu/faculty/angrist/data1/data/angkru95) #### Datasets for Regression Discontinuity Design [Population Threshold RDD Datasets](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/PGXO5O) ## Datasets for Learning Causal Relationships (Causal Discovery) #### Distinguishing Cause from Effect [Database with cause-effect pairs](http://webdav.tuebingen.mpg.de/cause-effect/) ## Datasets for Connections to Machine Learning ### Datasets with randomized test set for recommendation systems |Name|Paper|URL| |---|---|---| |Coat|Schnabel, Tobias, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. "Recommendations as treatments: Debiasing learning and evaluation." arXiv preprint arXiv:1602.05352 (2016).|[download](http://www.cs.cornell.edu/~schnabts/mnar/index.html)| |Yahoo! R3| |[download](https://webscope.sandbox.yahoo.com/catalog.php?datatype=r)|

Owner

  • Name: Heyang Gong
  • Login: 1587causalai
  • Kind: user

1587causalai

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1