eurocropsml

EuroCropsML is a ready-to-use benchmark dataset for few-shot crop type classification using Sentinel-2 imagery.

https://github.com/dida-do/eurocropsml

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 13 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.5%) to scientific vocabulary

Keywords

agriculture crop-classification dataset deep-learning earth-observation machine-learning sentinel-2 torch
Last synced: 6 months ago · JSON representation

Repository

EuroCropsML is a ready-to-use benchmark dataset for few-shot crop type classification using Sentinel-2 imagery.

Basic Info
Statistics
  • Stars: 20
  • Watchers: 1
  • Forks: 1
  • Open Issues: 4
  • Releases: 6
Topics
agriculture crop-classification dataset deep-learning earth-observation machine-learning sentinel-2 torch
Created almost 2 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog Contributing License Code of conduct

README.md

EuroCropsML

Ready-to-use benchmark dataset for few-shot crop type classification using Sentinel-2 imagery.

Part of the PreTrainAppEO ("Pre-Training Applicability in Earth Observation") research project.

Read the Docs PyPI Version Python Version GitHub License Zenodo DOI <!-- badges end -->

EuroCropsML is a pre-processed and ready-to-use machine learning dataset for crop type classification of agricultural parcels in Europe. It consists of a total of 706,683 Sentinel-2 multi-class labeled data points with a total of 176 distinct classes. Each data point contains an annual time series of per parcel median pixel values of Sentinel-2 L1C (top-of-atmosphere) reflectance data for the year 2021. The dataset is based on Version 9 of EuroCrops, an open-source collection of remote sensing reference data.

For EuroCropsML, we acquired and aggregated data for the following countries:

| Country | Number of distinct classes | Total number of datapoints for Sentinel-2 | |--------------|----------------------------|-------------------------------------------| | Estonia | 127 | 175,906 | | Latvia | 103 | 431,143 | | Portugal | 79 | 99,634 | |

Spatial distribution of labels within Estland and Latvia. Spatial distribution of labels within Portugal.

The distribution of class labels differs substantially between the regions of Estonia, Latvia, and Portugal. This makes transferring knowledge gained in one region to another region quite challenging, especially if only few labeled data points are available. Therefore, this dataset is particularly suited to explore transfer-learning methods for few-shot crop type classification.

The data acquisition, aggregation, and pre-processing steps are schematically illustrated below. A more detailed description is given in the dataset section of our documentation.

Data Acquisition Pipeline. <!-- teaser-end -->

Getting Started

eurocropsml is a Python package hosted on PyPI.

Installation

The recommended installation method is pip-installing into a virtual environment:

console $ python -Im pip install eurocropsml

Usage Guide

The quickest way to interact with the eurocropsml package and get started is to use the EuroCropsML dataset is via the provided command-line interface (CLI).

For example, to get help on available commands and options, use console $ eurocropsml-cli --help

To show the currently used (default) configuration for the eurocropsml dataset CLI, use console $ eurocropsml-cli datasets eurocrops config

To download the EuroCropsML dataset as currently configured, use console $ eurocropsml-cli datasets eurocrops download

Alternatively, the dataset can also be manually downloaded from our Zenodo repository.

A comprehensive documentation of the CLI can be found in the CLI Reference section of our documentation.

For a complete example use-case demonstrating the ready-to-use EuroCropsML dataset in action, please refer to the project's associated official repository for benchmarking meta-learning algorithms.

Project Information

The eurocropsml code repository is released under the MIT License. Its documentation lives at Read the Docs, the code on GitHub and the latest release can by found on PyPI. It is tested on Python 3.10+.

If you would like to contribute to eurocropsml you are most welcome. We have written a short guide to help you get started.

Background

The EuroCropsML dataset and associated eurocropsml code repository are provided and developed as part of the joint PretrainAppEO research project by the chair of Remote Sensing Technology at Technical University Munich and dida. <!-- project-background-middle -->

The goal of the project is to investigate methods that rely on the approach of pre-training and fine-tuning machine learning models in order to improve generalizability for various standard applications in Earth observation and remote sensing.

The ready-to-use EuroCopsML dataset is developed for the purpose of improving and benchmarking few-shot crop type classification methods.

EuroCropsML is based on Version 9 of EuroCrops, an open-source collection of remote sensing reference data for agriculture from countries of the European Union. <!-- project-background-end -->

Citation

If you use the EuroCropsML dataset or eurocropsml code repository in your research, please cite our project as follows:

Plain text text Reuss, J., Macdonald, J., Becker, S. et al. The EuroCropsML time series benchmark dataset for few-shot crop type classification in Europe. Sci Data 12, 664 (2025). https://doi.org/10.1038/s41597-025-04952-7

text Reuss, J., & Macdonald, J. (2024). EuroCropsML [dataset]. Zenodo. https://doi.org/10.5281/zenodo.10629609 Bibtex text @article{reuss_macdonald_eurocropsml_nsd_2025, author = {Reuss, Joana and Macdonald, Jan and Becker, Simon and Richter, Lorenz and K{\"o}rner, Marco}, title = {The EuroCropsML time series benchmark dataset for few-shot crop type classification in Europe}, journal = {Scientific Data}, year = {2025}, volume = {12}, note = {664}, issn = {2052-4463}, doi = {10.1038/s41597-025-04952-7}, url = {https://doi.org/10.1038/s41597-025-04952-7} }

text @misc{reuss_macdonald_eurocropsml_zenodo_2024, author = {Reuss, Joana and Macdonald, Jan}, title = {EuroCropsML}, year = {2024}, publisher = {Zenodo}, doi = {10.5281/zenodo.10629609}, url = {https://doi.org/10.5281/zenodo.10629609} }

Acknowledgments & Funding

The PreTrainAppEO research project is funded by the German Space Agency at DLR on behalf of the Federal Ministry for Economic Affairs and Climate Action (BMWK). <!-- further-info-end -->

Owner

  • Name: dida
  • Login: dida-do
  • Kind: organization
  • Location: Berlin

GitHub Events

Total
  • Create event: 19
  • Release event: 2
  • Issues event: 47
  • Watch event: 14
  • Delete event: 20
  • Member event: 1
  • Issue comment event: 9
  • Push event: 164
  • Pull request review comment event: 43
  • Pull request review event: 57
  • Pull request event: 37
Last Year
  • Create event: 19
  • Release event: 2
  • Issues event: 47
  • Watch event: 14
  • Delete event: 20
  • Member event: 1
  • Issue comment event: 9
  • Push event: 164
  • Pull request review comment event: 43
  • Pull request review event: 57
  • Pull request event: 37

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 65
  • Total Committers: 3
  • Avg Commits per committer: 21.667
  • Development Distribution Score (DDS): 0.462
Past Year
  • Commits: 25
  • Committers: 3
  • Avg Commits per committer: 8.333
  • Development Distribution Score (DDS): 0.28
Top Committers
Name Email Commits
jmaces j****s@h****e 35
jsreuss 7****s 28
katya 6****o 2
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 58
  • Total pull requests: 75
  • Average time to close issues: 15 days
  • Average time to close pull requests: 13 days
  • Total issue authors: 3
  • Total pull request authors: 3
  • Average comments per issue: 0.07
  • Average comments per pull request: 0.17
  • Merged pull requests: 60
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 31
  • Pull requests: 44
  • Average time to close issues: 8 days
  • Average time to close pull requests: 17 days
  • Issue authors: 2
  • Pull request authors: 3
  • Average comments per issue: 0.06
  • Average comments per pull request: 0.11
  • Merged pull requests: 34
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • jsreuss (52)
  • jmaces (4)
  • katyagikalo (2)
Pull Request Authors
  • jsreuss (58)
  • jmaces (10)
  • katyagikalo (7)
Top Labels
Issue Labels
enhancement (22) bug (18) documentation (6) wontfix (1)
Pull Request Labels
bug (4) documentation (3) enhancement (2)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 44 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 6
  • Total maintainers: 1
pypi.org: eurocropsml

EuroCropsML is a ready-to-use benchmark dataset for few-shot crop type classification using Sentinel-2 imagery.

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 44 Last month
Rankings
Dependent packages count: 9.5%
Forks count: 33.0%
Average: 36.9%
Stargazers count: 42.7%
Dependent repos count: 62.6%
Maintainers (1)
Last synced: 6 months ago