https://github.com/climate-ref/ref-sample-data

Sample datasets used for the REF test suite

https://github.com/climate-ref/ref-sample-data

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.0%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Sample datasets used for the REF test suite

Basic Info
  • Host: GitHub
  • Owner: Climate-REF
  • License: apache-2.0
  • Language: Python
  • Default Branch: main
  • Size: 334 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 2
  • Open Issues: 2
  • Releases: 1
Created over 1 year ago · Last pushed 10 months ago
Metadata Files
Readme Changelog Notice

README.md

CMIP Rapid Evaluation Framework Sample Data

Status: This project is in active development. We expect to be ready for beta releases in Q2 2025.

The CMIP Rapid Evaluation Framework is a Python application that provides the ability to rapidly process and evaluate CMIP data against a set of reference data. This repository contains a set of sample data that can be used to test the REF.

CMIP REF is a community project, and we welcome contributions from anyone.

Getting started

The sample data is stored in the data directory. The data is stored in a directory structure that mirrors the structure of the data on the ESGF, but each file contains only a subset of the data. The registry.txt file contains a list of the datasets that are included in the sample data and their hashes. These data are fetched by the REF's test suite to ensure a consistent collection of input datasets are used for testing.

The data that are tracked and the decimation process is described in: scripts/fetchtestdata.py

Development Environment

This project uses pixi as a package manager. To get started:

  1. Install pixi: https://pixi.sh/dev/install/
  2. Set up the development environment: bash # These commands can also be run via `make virtual-environment` if you have `make` installed. pixi install pixi run pre-commit install

Regenerating the sample data

The binary content of the generated sample data may vary by platform and various other dependencies. In order to ensure that the hashes are consistent across platforms, the hashes are generated on the CI and stored in the registry.txt file. These hashes are verified as part of the CI pipeline to ensure that the data is consistent.

If the hashes in the registry.txt file are out of date, you can regenerate the sample data by manually running the Regenerate workflow. This workflow will regenerate the sample data and update the registry.txt file with the updated hashes. Be sure to pull the new commit if you need to add any additional changes to your pull request.

Owner

  • Name: Climate REF
  • Login: Climate-REF
  • Kind: organization
  • Email: cmip-ipo@esa.int

Climate Rapid Evaluation Framework

GitHub Events

Total
  • Create event: 31
  • Release event: 1
  • Issues event: 2
  • Delete event: 8
  • Issue comment event: 116
  • Push event: 108
  • Pull request review comment event: 4
  • Pull request review event: 11
  • Pull request event: 49
Last Year
  • Create event: 31
  • Release event: 1
  • Issues event: 2
  • Delete event: 8
  • Issue comment event: 116
  • Push event: 108
  • Pull request review comment event: 4
  • Pull request review event: 11
  • Pull request event: 49

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 2
  • Total pull requests: 26
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 1 day
  • Total issue authors: 1
  • Total pull request authors: 3
  • Average comments per issue: 2.5
  • Average comments per pull request: 2.38
  • Merged pull requests: 16
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 26
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 1 day
  • Issue authors: 1
  • Pull request authors: 3
  • Average comments per issue: 2.5
  • Average comments per pull request: 2.38
  • Merged pull requests: 16
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • lewisjared (2)
Pull Request Authors
  • lewisjared (12)
  • bouweandela (10)
  • nocollier (4)
Top Labels
Issue Labels
good first issue (1)
Pull Request Labels

Dependencies

.github/actions/setup/action.yml actions
  • astral-sh/setup-uv v5 composite
.github/workflows/bump.yaml actions
  • ./.github/actions/setup * composite
  • actions/checkout v4 composite
.github/workflows/ci.yaml actions
  • ./.github/actions/setup * composite
  • actions/checkout v4 composite
  • tj-actions/changed-files v45 composite
pyproject.toml pypi
  • intake-esgf *
  • matplotlib >=3.10.0
  • pooch *
  • scipy >=1.15.0
  • xarray >=2024.10.0