https://github.com/boom-lab/crocolake-python

https://github.com/boom-lab/crocolake-python

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.6%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: boom-lab
  • License: gpl-3.0
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 47.9 KB
Statistics
  • Stars: 1
  • Watchers: 3
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme License

README.md

CrocoLake-Python

CrocoLake-Python is a collection of Jupyter notebooks that shows how to interface with CrocoLake and Argo's parquet databases with Python.

Table of Contents

  1. Usage
  2. Examples
  3. Databases
  4. Contact

Usage

python

Create your local environment (creates a folder in your current directory) virtualenv crocolake

Activate the new python environment source crocolake/bin/activate

Install the required packages pip install .

Launch jupyter lab to access the notebooks: jupyter lab

If you don't have virtualenv in your machine, you can install it with pip install virtualenv

conda

Create the local environment conda env create -f environment.yml

Activate the environment: conda activate crocolake

Launch jupyter lab to access the notebooks: jupyter lab

Notebooks

You can then launch any notebook from the notebooks folder and execute it. Each example needs a specific dataset, and it contains code to download it to your local machine.

Note that there are a couple of ways to load parquet datasets in a dataframe in Python: using pyarrow and using dask. Example 1 and Example 2 show both, while the other examples use the one that in my experience is most efficient (i.e. dask).

Examples

The notebooks folder contains four examples/tutorials:

  1. Example 1 shows how to make a map of dissolved oxygen content in the North West Atlantic;
  2. Example 2 shows how to make a map of temperature measurements in the North West Atlantic, including information about the source (Argo, GLODAP, or Spray Gliders);
  3. Example 3 shows how to make temperature-salinity plots from Argo QC-ed measurements;
  4. Example 4 shows how to make an animation of Argo's fleet growth over time on a world map;
  5. Example 5 shows how to make a map of dissolved oxygen measurements in the Pacific off the coast of California.

Databases

The following databases are currently available:

  • CrocoLake: contains the best available data from Argo, GLODAP, and Spray Gliders. More details here. This example uses CrocoLake.
  • Argo 'QC': contains the best available data, that is real time values are reported only when delayed values are not available. This version is the same used in CrocoLake, and here you can find more details on how it is generated. This example uses Argo 'QC'.
  • Argo 'ALL': contains all real time and adjusted variables as reported in the core ('prof.nc') and synthetic ('<PLATFORMNUMBER>_Sprof.nc') profile files, for the physical and biogeochemical versions respectively. Both this and this examples use Argo 'ALL'.

Each database comes in 'PHY' and 'BGC' versions.

Contact

For any questions, bugs, missing information, etc, open an issue or get in touch!

Owner

  • Name: Woods Hole Oceanographic Institution Biogeochemical Ocean Observing and Modeling Lab
  • Login: boom-lab
  • Kind: organization

GitHub Events

Total
  • Watch event: 1
  • Member event: 1
  • Push event: 1
  • Public event: 1
  • Pull request event: 1
  • Fork event: 1
  • Create event: 1
Last Year
  • Watch event: 1
  • Member event: 1
  • Push event: 1
  • Public event: 1
  • Pull request event: 1
  • Fork event: 1
  • Create event: 1

Dependencies

setup.py pypi
  • argopy ==0.1.15
  • bokeh *
  • cartopy *
  • dask *