ptype-ml-cookbook

A short cookbook that is a companion to Unidata's CyberTraining project.

https://github.com/projectpythia/ptype-ml-cookbook

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.6%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

A short cookbook that is a companion to Unidata's CyberTraining project.

Basic Info
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 3
  • Open Issues: 3
  • Releases: 0
Created over 1 year ago · Last pushed 6 months ago
Metadata Files
Readme License Citation

README.md

Precipitation Machine Learning Cookbook

thumbnail

nightly-build Binder DOI

This Project Pythia Cookbook covers an extremely basic precipitation classification project. This notebook will introduce learners to the scikit-learn API, basic exploratory data analysis (EDA), and evaluations. It is meant to be a very early and basic introduction to these concepts, it is not meant to be an in-depth intorduction to machine learning. It could be the first introduction to machine learning for learners familiar with weather data.

Motivation

This cookbook is meant to be a companion to Unidata's CyberTraining project.

Authors

Ana Castaneda Montoya, Thomas Martin

Contributors

Structure

This notebook has a few sections, from inital data loading to a end to end machine learning workflow.

Exploratory Data Analysis

This section gives some nice examples of pair plots in Seaborn, and Correlation Matricies.

Dataset Splitting

For machine learning, we need a testing, training, and validation dataset. This section covers how to do that, and gives some great refrences on the why.

Dataset Scaling

For (most) machine learning models, scaling is necessary. This sections covers how to do that.

Machine Learning (!!!)

The part where we actually train a model! We also see how good it is.

Running the Notebooks

You can either run the notebook using Binder or on your local machine.

Running on Binder

The simplest way to interact with a Jupyter Notebook is through Binder, which enables the execution of a Jupyter Book in the cloud. The details of how this works are not important for now. All you need to know is how to launch a Pythia Cookbooks chapter via Binder. Simply navigate your mouse to the top right corner of the book chapter you are viewing and click on the rocket ship icon, (see figure below), and be sure to select “launch Binder”. After a moment you should be presented with a notebook that you can interact with. I.e. you’ll be able to execute and even change the example programs. You’ll see that the code cells have no output at first, until you execute them by pressing {kbd}Shift+{kbd}Enter. Complete details on how to interact with a live Jupyter notebook are described in Getting Started with Jupyter.

Running on Your Own Machine

If you are interested in running this material locally on your computer, you will need to follow this workflow:

(Replace "cookbook-example" with the title of your cookbooks)

  1. Clone the https://github.com/ThomasMGeo/ptype-ml-cookbook repository:

bash git clone https://github.com/ThomasMGeo/ptype-ml-cookbook.git

  1. Create and activate your conda environment from the environment.yml file bash conda env create -f environment.yml conda activate cookbook-example
  2. Move into the notebooks directory and start up Jupyterlab bash cd notebooks/ jupyter lab

Owner

  • Name: Project Pythia
  • Login: ProjectPythia
  • Kind: organization
  • Email: projectpythia@ucar.edu
  • Location: United States of America

Community learning resource for Python-based computing in the geosciences

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this cookbook, please cite it as below."
authors:
  # add additional entries for each author -- see https://github.com/citation-file-format/citation-file-format/blob/main/schema-guide.md
  - family-names: Rose
    given-names: Brian E. J.
    orcid: https://orcid.org/0000-0002-9961-3821 # optional
    website: https://github.com/brian-rose # optional
    affiliation: University at Albany (State University of New York) # optional
  - family-names: Kent
    given-names: Julia
    orcid: https://orcid.org/0000-0002-5611-8986
    website: https://github.com/jukent
    affiliation: UCAR/NCAR
  - family-names: Tyle
    given-names: Kevin
    orcid: https://orcid.org/0000-0001-5249-9665
    website: https://github.com/ktyle
    affiliation: University at Albany (State University of New York)
  - family-names: Clyne
    given-names: John
    orcid: https://orcid.org/0000-0003-2788-9017
    website: https://github.com/clyne
    affiliation: UCAR/NCAR
  - family-names: Camron
    given-names: Drew
    orcid: https://orcid.org/0000-0001-7246-6502
    website: https://github.com/dcamron
    affiliation: UCAR/Unidata
  - family-names: Grover
    given-names: Maxwell
    orcid: https://orcid.org/0000-0002-0370-8974
    website: https://github.com/mgrover1
    affiliation: Argonne National Laboratory
  - family-names: Ford
    given-names: Robert R.
    orcid: https://orcid.org/0000-0001-5483-4965
    website: https://github.com/r-ford
    affiliation: University at Albany (State University of New York)
  - family-names: Paul
    given-names: Kevin
    orcid: https://orcid.org/0000-0001-8155-8038
    website: https://github.com/kmpaul
    affiliation: NVIDIA
  - name: "Cookbook Template contributors" # use the 'name' field to acknowledge organizations
    website: "https://github.com/ProjectPythia/cookbook-template/graphs/contributors"
title: "Cookbook Template"
abstract: "A sample cookbook description."

GitHub Events

Total
  • Delete event: 1
  • Issue comment event: 3
  • Push event: 51
  • Pull request review event: 1
  • Pull request event: 4
  • Create event: 2
Last Year
  • Delete event: 1
  • Issue comment event: 3
  • Push event: 51
  • Pull request review event: 1
  • Pull request event: 4
  • Create event: 2

Dependencies

.github/workflows/nightly-build.yaml actions
.github/workflows/publish-book.yaml actions
.github/workflows/trigger-book-build.yaml actions
.github/workflows/trigger-delete-preview.yaml actions
.github/workflows/trigger-link-check.yaml actions
.github/workflows/trigger-preview.yaml actions
.github/workflows/trigger-replace-links.yaml actions
  • actions/checkout v4 composite
  • jacobtomlinson/gha-find-replace v3 composite
  • stefanzweifel/git-auto-commit-action v5 composite
environment.yml pypi