stationbench

Benchmarking of weather forecasts based on station observations

https://github.com/juaai/stationbench

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 3 DOI reference(s) in README
✓
Academic publication links
Links to: zenodo.org
○
Committers with academic emails
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.4%) to scientific vocabulary

Keywords

benchmarking forecasting weather

Last synced: 9 months ago · JSON representation

Repository

Benchmarking of weather forecasts based on station observations

Basic Info

Host: GitHub
Owner: juaAI
License: mit
Language: Python
Default Branch: main
Homepage: https://pypi.org/project/stationbench/
Size: 660 MB

Statistics

Stars: 80
Watchers: 3
Forks: 3
Open Issues: 3
Releases: 2

Topics

benchmarking forecasting weather

Created over 1 year ago · Last pushed 11 months ago

Metadata Files

Readme Contributing License Codeowners

StationBench

StationBench is a Python library for benchmarking weather forecasts against weather station data. It provides tools to calculate metrics, visualize results, and compare different forecast models.

Features

Pre-processed ground truth data from 10,000+ weather stations around the world included in the package
Calculate RMSE and other metrics between forecasts and ground truth data
Support for multiple weather variables (temperature, wind speed, solar radiation)
Regional analysis capabilities (Europe, North America, Global, etc.)
Integration with Weights & Biases for experiment tracking

Installation

bash pip install stationbench

Documentation

Full documentation is available in the docs/ directory: - Setup - How to setup StationBench - Tutorial - Basic usage of StationBench

Quick Start

Data Format Requirements

Forecast Data

Must include dimensions: latitude, longitude, time
Variables should include:
- 10mwindspeed (or custom name)
- 2m_temperature (or custom name)

Ground Truth Data

Stationbench comes with ready-to-use weather stations from around the world. The benchmarking data is a subset of the Meteostat dataset. It contains weather data from 2018-2024 for 10m wind speed and 2m temperature. The data is provided by the following organizations: - Deutscher Wetterdienst - NOAA - Government of Canada - MET Norway - European Data Portal - Offene Daten Österreich

Source: Meteostat (CC BY-NC 4.0)

The benchmarking data can be accessed from https://opendata.jua.ai/stationbench/meteostat_benchmark.zarr.

Map of weather stations used for benchmarking

Number of stations reporting over time

Besides the provided benchmarking data, you can also use your own ground truth data. The ground truth data must be in zarr format and must include the following dimensions and coordinates: - Must include dimensions: station_id, time - Must include coordinates: latitude, longitude

Calculate Metrics

This script computes metrics by comparing forecast data against ground truth data for specified time periods and regions. Output are RMSE, MBE and skill scores for different variables and lead times in the format of the ground truth data.

Options

--forecast: Location of the forecast data (required)
--stations: Location of the ground truth data (defaults to https://opendata.jua.ai/stationbench/meteostat_benchmark.zarr)
--start_date: Start date for benchmarking (required)
--end_date: End date for benchmarking (required)
--output: Output path for benchmarks (required)
--region: Region to benchmark (see regions.py for available regions)
--name_10m_wind_speed: Name of 10m wind speed variable (optional)
--name_2m_temperature: Name of 2m temperature variable (optional)
--use_dask: Enable parallel computation with Dask (recommended for datasets >10GB)
--n_workers: Number of Dask workers to use (default: 4, only used if --use_dask is set and no client exists)

If variable name is not provided, no metrics will be computed for that variable.

Compare forecasts

After generating the metrics, you can use the compare_forecasts.py script to compute metrics, create visualizations, and log the results to Weights & Biases (W&B).

What it does

The compare_forecasts.py script: 1. Computes RMSE (Root Mean Square Error) and skill scores for different variables and lead time ranges. 2. Generates geographical scatter plots showing the spatial distribution of errors. 3. Creates line plots showing the temporal evolution of errors. 4. Saves all visualizations and metrics to a directory, optionally logs to Weights & Biases.

Options

--benchmark_datasets_locs: Dictionary of reference benchmark locations, the skill score is computed between the first and the second dataset (required)
--regions: Comma-separated list of regions, see regions.py for available regions (required)
--wandb_run_name: Weights & Biases run name (optional), if not provided, Weights & Biases will not be used
--output_dir: Output directory for results (optional, defaults to stationbench-results)

Usage

StationBench can be used either as a Python package or through command-line interfaces.

Python Package Usage

```python import stationbench

Calculate metrics

stationbench.calculatemetrics( forecast="path/to/forecast.zarr", startdate="2023-01-01", enddate="2023-12-31", output="path/to/forecastmetrics.zarr", region="europe", name10mwindspeed="10si", name2m_temperature="2t" )

Compare forecasts

stationbench.compareforecasts( benchmarkdatasetslocs={"HRES": "path/to/hresmetrics.zarr", "ENS": "path/to/ens_metrics.zarr"}, regions=["europe"] ) ```

Command-Line Usage

Calculate metrics for a forecast dataset:

bash stationbench-calculate \ --forecast path/to/forecast.zarr \ --start_date 2023-01-01 \ --end_date 2023-12-31 \ --output path/to/forecast_metrics.zarr \ --region europe \ --name_10m_wind_speed "10si" \ --name_2m_temperature "2t" [--use_dask] # Optional: Enable parallel computation with Dask [--n_workers 4] # Optional: Number of Dask workers to use For small datasets, it's recommended to run without Dask. For large datasets (>10GB), enabling Dask with --use_dask can improve performance.

Compare forecasts: bash stationbench-compare \ --benchmark_datasets_locs '{"HRES": "path/to/hres_metrics.zarr", "ENS": "path/to/ens_metrics.zarr"}' \ --regions europe \ [--wandb_run_name "run_name"] \ [--output_dir "path/to/output_dir"]

Contributing

We welcome contributions! Please see our CONTRIBUTING.md for details.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Metrics

StationBench calculates the following verification metrics:

RMSE (Root Mean Square Error): Measures the average magnitude of forecast errors, giving greater weight to larger errors
MBE (Mean Bias Error): Measures the average direction and magnitude of forecast bias. Positive values indicate the forecast tends to overpredict, while negative values indicate underprediction.

We plan to add more benchmarking metrics in the future...

Regional Analysis

StationBench supports several predefined regions and allows you to create custom regions.

For details on creating and using custom regions, see the Custom Regions Guide.

Owner

Name: Jua
Login: juaAI
Kind: organization
Location: Germany

Website: jua.ai
Repositories: 2
Profile: https://github.com/juaAI

next generation environmental insights platform

GitHub Events

Total

Create event: 22
Issues event: 17
Release event: 3
Watch event: 67
Delete event: 24
Issue comment event: 10
Public event: 1
Push event: 75
Pull request event: 38
Pull request review event: 64
Pull request review comment event: 42
Fork event: 3

Last Year

Create event: 22
Issues event: 17
Release event: 3
Watch event: 67
Delete event: 24
Issue comment event: 10
Public event: 1
Push event: 75
Pull request event: 38
Pull request review event: 64
Pull request review comment event: 42
Fork event: 3

Committers

Last synced: about 1 year ago

All Time

Total Commits: 37
Total Committers: 3
Avg Commits per committer: 12.333
Development Distribution Score (DDS): 0.459

Past Year

Commits: 37
Committers: 3
Avg Commits per committer: 12.333
Development Distribution Score (DDS): 0.459

Top Committers

Name	Email	Commits
leoniewgnr	4****r	20
Andreas Schlueter	a**r@j**i	16
Alexander Jakob Dautel	h****e	1

Committer Domains (Top 20 + Academic)

jua.ai: 1

Issues and Pull Requests

Last synced: 9 months ago

All Time

Total issues: 15
Total pull requests: 53
Average time to close issues: 27 days
Average time to close pull requests: 2 days
Total issue authors: 6
Total pull request authors: 5
Average comments per issue: 0.07
Average comments per pull request: 0.34
Merged pull requests: 49
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 15
Pull requests: 53
Average time to close issues: 27 days
Average time to close pull requests: 2 days
Issue authors: 6
Pull request authors: 5
Average comments per issue: 0.07
Average comments per pull request: 0.34
Merged pull requests: 49
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

roansong (4)
deepweather (4)
leoniewgnr (3)
aschl (2)
manmeet3591 (1)
kevinjuaai (1)

Pull Request Authors

leoniewgnr (34)
aschl (13)
howtodowtle (2)
kevinjuaai (2)
niasie (2)

stationbench

Science Score: 49.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

StationBench

Features

Installation

Documentation

Quick Start

Data Format Requirements

Forecast Data

Ground Truth Data

Calculate Metrics

Options

Compare forecasts

What it does

Options

Usage

Python Package Usage

Calculate metrics

Compare forecasts

Command-Line Usage

Contributing

License

Metrics

Regional Analysis

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels