https://github.com/atrisovic/space

https://github.com/atrisovic/space

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.8%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: atrisovic
  • Language: Shell
  • Default Branch: master
  • Size: 7.81 KB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 3 years ago · Last pushed over 3 years ago
Metadata Files
Readme

README.md

SpaCE: The Spatial Confounding (Benchmarking) Environment

🚀 Description

SpaCE: The Spatial Confounding Environment is a benchmarking dataset for causal inference that incorporartes spatial confounding. The SpaCE datasets contain real confounder and exposure/treatment data inspired by environmental health studies. The synthetic outcome and counterfactual are generated according to recommended practices for causal evaluation by mimicking the real outcome data distribution learned with machine learning and neural network methods. Spatial confounding is achieved by masking influential confounders in the learned model.

🛰️ Code and methods

The code for source data processing and outcome modeling can be found in the analysis directory. The code for the SpaCE benchmarking data retrieval and sampling is in the data directory. The code benchmarking dataset with all confounders can be viewed on the Harvard Dataverse repository for transparency and reproducibility.

🧑‍🚀 The API

To retrieve and generate the data, run the command below in the Terminal:

sh curl -sSL https://raw.githubusercontent.com/NSAPH-Projects/space/master/data/data.sh | bash -s NN|XGBOOST SEED PATH

The input parameters of the command above are:

| Parameter | Meaning | |--------------------|-------------------| | method | NN or XGBOOST | | random seed | integer | | path (optional) | output file path |

The command downloads the core data and uses it together with the user's input to generate a sample of the potential outcomes (factual and counterfactual) from the core data and the model predictions. The sampling mechanism is tuned to mimic the variability in the observed data.

🔭 Data dictionary

The data dictionary for the SpaCE benchmark data:

| Variable | Values | |--------------------|-------------------| | treatment | | | covariates | | | syntheticoutcome | | | models | xgboost, nnvcnet | | errorgpkernel | | | predictions | | | counterfactuals | | | spatial_smoothness | | | importance | |

👽 Contact

We welcome contributions and feedback about SpaCE. If you have any suggestions or ideas, please open an issue or submit a pull request. Thank you for your interest in our data.

Owner

  • Name: Ana Trisovic
  • Login: atrisovic
  • Kind: user
  • Location: Cambridge, USA
  • Company: Harvard University

Computer Scientist and #Reproducibility Researcher at @HarvardBiostats & @IQSS | previously with @UChicago, @Cambridge_Uni and @LHCbExperiment @CERN

GitHub Events

Total
Last Year

Committers

Last synced: about 1 year ago

All Time
  • Total Commits: 8
  • Total Committers: 1
  • Avg Commits per committer: 8.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Ana Trisovic a****c@g****m 8

Issues and Pull Requests

Last synced: about 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • atrisovic (1)
Top Labels
Issue Labels
Pull Request Labels