eurec4a-intake

Intake catalogue for EUREC4A field campaign datasets

https://github.com/eurec4a/eurec4a-intake

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    6 of 17 committers (35.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Intake catalogue for EUREC4A field campaign datasets

Basic Info
  • Host: GitHub
  • Owner: eurec4a
  • Language: Python
  • Default Branch: master
  • Size: 426 KB
Statistics
  • Stars: 17
  • Watchers: 2
  • Forks: 18
  • Open Issues: 9
  • Releases: 1
Created over 5 years ago · Last pushed 8 months ago
Metadata Files
Readme Changelog Contributing Citation Zenodo

README.md

EUREC4A Intake catalogue

weekly_tests.yml

This repository contains an intake catalogue for acessing data from the EUREC4A field campaign stored on: 1) AERIS and 2) Munich University (via OPeNDAP) and 3) OPeNDAP access to files at NOAA's National Center for Environmental Information and 4) data linked via IPFS.

Usage

To use you will need to install intake, xarray, intake-xarray, zarr, pydap, requests, s3fs and ipfsspec

bash pip install "intake<2.0.0" xarray intake-xarray zarr pydap s3fs requests ipfsspec

Or, if you feel courageous (and want the newest updates), you can also install the requirements.txt directly:

bash pip install -r https://raw.githubusercontent.com/eurec4a/eurec4a-intake/master/requirements.txt

The catalogue (and underlying data) can then be accessed directly from python:

```python

from intake import opencatalog cat = opencatalog("https://raw.githubusercontent.com/eurec4a/eurec4a-intake/master/catalog.yml") ```

You can list the available sources with: ```python

list(cat) ['radiosondes', 'barbados', 'dropsondes', 'halo', 'p3', 'specmacs']

list(cat.radiosondes) ['atalantemeteomodem', 'atalantevaisala', 'bco', 'meteor', 'ms_merian', 'ronbrown'] ```

Then load up a dask-backed xarray.Dataset so that you have access to all the available variables and attributes in the dataset:

```python

ds = cat.radiosondes.ronbrown.todask() ds Dimensions: (alt: 3100, nv: 2, sounding: 329) Coordinates: * alt (alt) int16 0 10 20 30 40 50 ... 30950 30960 30970 30980 30990 flighttime (sounding, alt) datetime64[ns] dask.array lat (sounding, alt) float32 dask.array lon (sounding, alt) float32 dask.array soundingid (sounding) |S1000 dask.array Dimensions without coordinates: nv, sounding Data variables: Ngps (sounding, alt) float32 dask.array Nptu (sounding, alt) float32 dask.array altbnds (alt, nv) int16 dask.array ... ```

You can then slice and access the data as if you had it available locally

Contributing

Please have a look at our contribution guide.

Owner

  • Name: eurec4a
  • Login: eurec4a
  • Kind: organization

Citation (CITATION.md)

# Citing

Please cite the catalog as following when refering to a specific version of the catalog, e.g., in the data availability section:

```bibtex
@dataset{eurec4a_community_2023,
  title = {{{EUREC4A}}},
  author = {{community}, EUREC4A},
  date = {2023-10},
  publisher = {{Zenodo}},
  doi = {10.5281/zenodo.8422322},
  version = {v1.0.0}
}
```

In case the catalog is cited within the main body of the manuscript, please reference the general version of catalog available at https://zenodo.org/doi/10.5281/zenodo.8422321

> [!IMPORTANT]
> The catalog only links to datasets and has no influence or control on the data itself.
To improve the reproducability and to give credit to the authors of the underlying datasets,
it is recommended to cite each dataset individually in addition to the catalog itself.

## Example citation for data availability section

> The EUREC4A data used in this study is freely available and can be easily accessed via the EUREC4A-Intake catalog at
https://github.com/eurec4a/eurec4a-intake as described at howto.eurec4a.eu. The specific data sets used from
the catalog v1.0.0 (EUREC4A community, 2023) include the JOANNE dropsonde data set (George, 2021),
the EUREC4A radiosonde data set (Stephan et al., 2020),...

GitHub Events

Total
  • Issues event: 2
  • Issue comment event: 7
  • Push event: 4
  • Pull request review event: 1
  • Pull request event: 3
  • Create event: 2
Last Year
  • Issues event: 2
  • Issue comment event: 7
  • Push event: 4
  • Pull request review event: 1
  • Pull request event: 3
  • Create event: 2

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 363
  • Total Committers: 17
  • Avg Commits per committer: 21.353
  • Development Distribution Score (DDS): 0.529
Past Year
  • Commits: 14
  • Committers: 2
  • Avg Commits per committer: 7.0
  • Development Distribution Score (DDS): 0.071
Top Committers
Name Email Commits
Hauke Schulz 4****s 171
Tobias Kölling t****g@m****e 72
Robert Pincus R****s@c****u 32
Leif Denby l****f@d****u 23
Theresa Mieslinger t****r@g****m 18
Johannes Röttenbacher j****r@w****e 9
JuleRadtke j****e@u****e 9
Marek Jacob M****X 5
Fredrik Jansson f****n@a****i 5
Lea Volkmer L****r@p****e 4
Johannes Röttenbacher 4****r 3
Clau Clouds 3****s 3
Xuanyu Chen x****n@c****u 2
Veronika Pörtge v****e@g****m 2
lbariteau 7****u 2
leonie-villiger l****r@e****h 2
h_campos g****b@c****e 1

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 17
  • Total pull requests: 95
  • Average time to close issues: 3 months
  • Average time to close pull requests: about 2 months
  • Total issue authors: 6
  • Total pull request authors: 12
  • Average comments per issue: 4.76
  • Average comments per pull request: 1.97
  • Merged pull requests: 79
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 4
  • Average time to close issues: N/A
  • Average time to close pull requests: about 6 hours
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 5.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • observingClouds (10)
  • suessspeise (2)
  • d70-t (2)
  • martinjanssens (1)
  • leifdenby (1)
  • franslql (1)
Pull Request Authors
  • observingClouds (62)
  • d70-t (18)
  • leifdenby (4)
  • RobertPincus (4)
  • lbariteau (2)
  • jroettenbacher (2)
  • leonie-villiger (2)
  • fjansson (2)
  • vpoertge (1)
  • suessspeise (1)
  • lvol08 (1)
  • ClauClouds (1)
Top Labels
Issue Labels
data version update (3) data source changed (3) bug (2) enhancement (1) good first issue (1) upstream (1)
Pull Request Labels
data version update (15) data source changed (14) new dataset (8) infrastructure (1) bug (1) enhancement (1)

Packages

  • Total packages: 1
  • Total downloads: unknown
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 1
proxy.golang.org: github.com/eurec4a/eurec4a-intake
  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 5.4%
Average: 5.6%
Dependent repos count: 5.8%
Last synced: 6 months ago

Dependencies

requirements.txt pypi
  • fsspec >=0.7.4
  • intake *
  • intake-xarray >=0.3.2
  • ipfsspec *
  • pydap *
  • requests *
  • s3fs *
  • xarray *
  • zarr *
.github/workflows/python-package.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
.github/workflows/weekly_tests.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite