pythonnumericaldemos

Well-documented Python demonstrations for spatial data analytics, geostatistical and machine learning to support my courses.

https://github.com/geostatsguy/pythonnumericaldemos

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: scholar.google, zenodo.org
  • Committers with academic emails
    1 of 1 committers (100.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.3%) to scientific vocabulary

Keywords

datanalytics geostatistics machine-learning python
Last synced: 6 months ago · JSON representation ·

Repository

Well-documented Python demonstrations for spatial data analytics, geostatistical and machine learning to support my courses.

Basic Info
  • Host: GitHub
  • Owner: GeostatsGuy
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: master
  • Homepage:
  • Size: 203 MB
Statistics
  • Stars: 1,446
  • Watchers: 62
  • Forks: 501
  • Open Issues: 15
  • Releases: 1
Topics
datanalytics geostatistics machine-learning python
Created over 8 years ago · Last pushed about 1 year ago
Metadata Files
Readme License Citation

README.md

PythonNumericalDemos: Educational Data Science Python Demonstrations Repository (0.0.1)

Well documented Python workflows to help you over the intellectual hurdles of data science!

To support my students in my *Data Analytics and Geostatistics, **Spatial Data Analytics and Machine Learning courses and anyone else learning data analytics and machine learning, I have developed a set of well-documented Python workflows. When students struggle with a concept I make and share a new well-documented so they can try out and repeat all the examples from my lectures to learn immediately how to apply the models or theoretical concepts!*

Michael Pyrcz, Professor, The University of Texas at Austin, Data Analytics, Geostatistics and Machine Learning

Twitter | GitHub | Website | GoogleScholar | Book | YouTube | LinkedIn


Cite As:

Pyrcz, Michael J. (2021). PythonNumericalDemos: Educational Data Science Python Demonstrations Repository (0.0.1). Zenodo. https://doi.org/10.5281/zenodo.5564966

DOI


Setup

A minimum environment includes:

  • Python 3.7.10 - due to the depdendency of GeostatsPy on the Numba package for code acceleration
  • MatPlotLib - plotting
  • NumPy - gridded data and array math
  • Pandas - tabulated data
  • SciPy - statistics module
  • ipywidgets - for plot interactivity
  • GeostatsPy - geostatistical algorithms and functions (Pyrcz et al., 2021)

The required datasets are available in the GeoDataSets repository and linked in the workflows.

Repository Summary

I use these demonstrations for:

  1. in-lecture walkthroughs
  2. hands-on experiential learning with specific tasks suggested in the accompanying lecture notes
  3. example workflows to support my students

Other resources that you might be interested in include:

While I have more than 20 years experience coding in FORTRAN, C++ and VBA, I just started with Python a couple of years ago. I love it. I code less and get more done! I recommend Python to any engineers or scientists. Here's some reasons to learn to code:

  1. Transparency – no compiler accepts hand waiving! Coding forces your logic to be uncovered for any other scientist or engineer to review.

  2. Reproducibility – run it, get an answer, hand it over, run it, get the same answer. This is a main principle of the scientific method.

  3. Quantification – programs need numbers. Feed the program and discover new ways to look at the world.

  4. Open-source – leverage a world of brilliance. Check out packages, snippets and be amazed with what great minds have freely shared.

  5. Break Down Barriers – don’t throw it over the fence. Sit at the table with the developers and share more of your subject matter expertise for a better product.

  6. Deployment – share it with others and multiply the impact. Performance metrics or altruism, your good work benefits many others.

  7. Efficiency – minimize the boring parts of the job. Build a suite of scripts for automation of common tasks and spend more time doing science and engineering!

  8. Always Time to Do it Again! – how many times did you only do it once? It probably takes 2-4 times as long to script and automate a workflow. Usually worth it.

  9. Be Like Us – it will change you. Users feel limited, programmers truly harness the power of their applications and hardware.

Alright, that is enough of my ranting. Especially, since I haven't even introduced myself yet!

The Repository Author:

Michael Pyrcz, Professor, The University of Texas at Austin

Novel Data Analytics, Geostatistics and Machine Learning Subsurface Solutions

With over 17 years of experience in subsurface consulting, research and development, Michael has returned to academia driven by his passion for teaching and enthusiasm for enhancing engineers' and geoscientists' impact in subsurface resource development.

For more about Michael check out these links:

Twitter | GitHub | Website | GoogleScholar | Book | YouTube | LinkedIn

Want to Work Together?

I hope this content is helpful to those that want to learn more about subsurface modeling, data analytics and machine learning. Students and working professionals are welcome to participate.

  • Want to invite me to visit your company for training, mentoring, project review, workflow design and / or consulting? I'd be happy to drop by and work with you!

  • Interested in partnering, supporting my graduate student research or my Subsurface Data Analytics and Machine Learning consortium (co-PIs including Profs. Foster, Torres-Verdin and van Oort)? My research combines data analytics, stochastic modeling and machine learning theory with practice to develop novel methods and workflows to add value. We are solving challenging subsurface problems!

  • I can be reached at mpyrcz@austin.utexas.edu.

I'm always happy to discuss,

Michael

Michael Pyrcz, Ph.D., P.Eng. Professor, Cockrell School of Engineering and The Jackson School of Geosciences, The University of Texas at Austin

More Resources Available at: Twitter | GitHub | Website | GoogleScholar | Book | YouTube | LinkedIn

Owner

  • Name: Michael Pyrcz
  • Login: GeostatsGuy
  • Kind: user
  • Location: Austin, TX, USA
  • Company: @UTAustin

Full Professor at The University of Texas at Austin working on Spatial Data Analytics, Geostatistics and Machine Learning

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this data repository, please cite it as below."
authors:
  - family-names: Pyrcz
    given-names: Michael J.
    orcid:  https://orcid.org/0000-0002-5983-219X 
title: "PythonNumericalDemos: Educational Data Science Demonstrations Repository"
version: 1.0.0
doi: https://zenodo.org/doi/10.5281/zenodo.5564966
date-released: 2021-10-12

GitHub Events

Total
  • Watch event: 57
  • Push event: 2
  • Fork event: 13
Last Year
  • Watch event: 57
  • Push event: 2
  • Fork event: 13

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 607
  • Total Committers: 1
  • Avg Commits per committer: 607.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 58
  • Committers: 1
  • Avg Commits per committer: 58.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Michael Pyrcz m****z@a****u 607
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 9
  • Total pull requests: 7
  • Average time to close issues: N/A
  • Average time to close pull requests: 12 days
  • Total issue authors: 9
  • Total pull request authors: 7
  • Average comments per issue: 0.22
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • cpgonzal (1)
  • nhasan144 (1)
  • siddharthfcb (1)
  • filippo82 (1)
  • rodiegeology (1)
  • jfozard (1)
  • KGoyal01 (1)
  • cload-uog (1)
  • ADharaUTEXAS123007 (1)
Pull Request Authors
  • conordonihoo (1)
  • caf3676 (1)
  • pitmonticone (1)
  • AndrewAnnex (1)
  • ImranTayyab (1)
  • ThatcherT (1)
  • AhmedMerz (1)
Top Labels
Issue Labels
Pull Request Labels