geodatasets

Synthetic datasets for geoscience (geo)statistical modeling

https://github.com/geostatsguy/geodatasets

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: scholar.google, zenodo.org
  • Committers with academic emails
    1 of 1 committers (100.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.3%) to scientific vocabulary

Keywords

data database spatial-data
Last synced: 6 months ago · JSON representation ·

Repository

Synthetic datasets for geoscience (geo)statistical modeling

Basic Info
  • Host: GitHub
  • Owner: GeostatsGuy
  • License: mit
  • Default Branch: master
  • Homepage:
  • Size: 11.9 MB
Statistics
  • Stars: 97
  • Watchers: 3
  • Forks: 120
  • Open Issues: 0
  • Releases: 1
Topics
data database spatial-data
Created about 8 years ago · Last pushed 8 months ago
Metadata Files
Readme License Citation

README.md

GeoDataSets: Synthetic Subsurface Data Repository (0.0.1)

Open-data multivariate, spatiotemporal datasets to support education and research!

To support education and repeatable research we need open-data, data that is openly accessible, exploitable, editable and shared by anyone for any purpose, licensed under an open license. For multivariate, spatiotemporal problems these datasets are not widely available. Also, it is very helpful to have access to the 'inaccessible', exhaustive truth model (the population from which samples are extracted). So I have used my geostatistics skills to make a wide variety of synthetic truth populations an sample datasets to support my educational content and research and in the spirit of open-data, I share it here for anyone to use.

Michael Pyrcz, Professor, The University of Texas at Austin, Data Analytics, Geostatistics and Machine Learning

Twitter | GitHub | Website | GoogleScholar | Book | YouTube | LinkedIn


Cite As:

Pyrcz, Michael J. (2021). GeoDataSets: Synthetic Subsurface Data Repository (0.0.1). Zenodo. https://doi.org/10.5281/zenodo.5564874

DOI


Repository Summary

A collection of synthetic subsurface datasets to support education, publications, and prototyping. This repository includes a wide variety of synthetic, subsurface datasets with a variety of:

Data Dimensionality

To support education with easy visualization and interactivity the datasets are 1D and 2D.

  • 1D cores from wells and 2D seismic maps.
Number of Features

For multivariate analysis some of the datasets include up to 6 features with a variety of structures.

  • linear and nonlinear, homoscedastic and heteroscedastic, and multivariate constraints
Data Issues

The datasets attempt to include typical issues such as non-physical values, random and structured noise

I hope this is helpful,

Michael

The Repository Author:

Michael Pyrcz, Professor, The University of Texas at Austin

Novel Data Analytics, Geostatistics and Machine Learning Subsurface Solutions

With over 17 years of experience in subsurface consulting, research and development, Michael has returned to academia driven by his passion for teaching and enthusiasm for enhancing engineers' and geoscientists' impact in subsurface resource development.

For more about Michael check out these links:

Twitter | GitHub | Website | GoogleScholar | Book | YouTube | LinkedIn

Want to Work Together?

I hope this content is helpful to those that want to learn more about subsurface modeling, data analytics and machine learning. Students and working professionals are welcome to participate.

  • Want to invite me to visit your company for training, mentoring, project review, workflow design and / or consulting? I'd be happy to drop by and work with you!

  • Interested in partnering, supporting my graduate student research or my Subsurface Data Analytics and Machine Learning consortium (co-PIs including Profs. Foster, Torres-Verdin and van Oort)? My research combines data analytics, stochastic modeling and machine learning theory with practice to develop novel methods and workflows to add value. We are solving challenging subsurface problems!

  • I can be reached at mpyrcz@austin.utexas.edu.

I'm always happy to discuss,

Michael

Michael Pyrcz, Ph.D., P.Eng. Professor, Cockrell School of Engineering and The Jackson School of Geosciences, The University of Texas at Austin

More Resources Available at: Twitter | GitHub | Website | GoogleScholar | Book | YouTube | LinkedIn

Owner

  • Name: Michael Pyrcz
  • Login: GeostatsGuy
  • Kind: user
  • Location: Austin, TX, USA
  • Company: @UTAustin

Full Professor at The University of Texas at Austin working on Spatial Data Analytics, Geostatistics and Machine Learning

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this data repository, please cite it as below."
authors:
  - family-names: Pyrcz
    given-names: Michael J.
    orcid:  https://orcid.org/0000-0002-5983-219X 
title: "GeoDataSets: Synthetic Subsurface Data Repository"
version: 0.0.1
doi: https://doi.org/10.5281/zenodo.5564874
date-released: 2021-10-12

GitHub Events

Total
  • Watch event: 14
  • Fork event: 2
Last Year
  • Watch event: 14
  • Fork event: 2

Committers

Last synced: over 1 year ago

All Time
  • Total Commits: 89
  • Total Committers: 1
  • Avg Commits per committer: 89.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 9
  • Committers: 1
  • Avg Commits per committer: 9.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Michael Pyrcz m****z@a****u 89
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels