https://github.com/arm-doe/arm-test-data

Test data for the atmospheric data community toolkit (ACT)

https://github.com/arm-doe/arm-test-data

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    3 of 5 committers (60.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (4.0%) to scientific vocabulary

Keywords from Contributors

atmospheric-science meteorology corrections meteorological-data retrieval
Last synced: 7 months ago · JSON representation

Repository

Test data for the atmospheric data community toolkit (ACT)

Basic Info
  • Host: GitHub
  • Owner: ARM-DOE
  • License: mit
  • Language: Assembly
  • Default Branch: main
  • Size: 37.9 MB
Statistics
  • Stars: 5
  • Watchers: 2
  • Forks: 5
  • Open Issues: 0
  • Releases: 15
Created over 2 years ago · Last pushed 7 months ago
Metadata Files
Readme License

README.md

arm-test-data

CI PyPI Version Conda Version

A place to share atmospheric data with the community, shared throughout the Atmospheric Radiation Measurement user facility and beyond!

Sample data sets

These files are used as sample data in openradar examples/notebooks and are downloaded by arm-test-data package:

  • 201509021500.bi
  • AAFNAV_COR_20181104_R0.ict
  • AMF_US-CU1_BASE_HH_1-5.csv
  • AMF_US-CU1_BIF_20250318.xlsx
  • NEON.D18.BARR.DP1.00002.001.000.010.001.SAAT_1min.2022-10.expanded.20221107T205629Z.csv
  • NEON.D18.BARR.DP1.00002.001.sensor_positions.20221107T205629Z.csv
  • NEON.D18.BARR.DP1.00002.001.variables.20221201T110553Z.csv
  • anltwr_mar19met.data
  • ayp22199.21m
  • ayp22200.00m
  • brw21001.dat
  • brw_12_2020_hour.dat
  • brw_CCl4_Day.dat
  • co2_brw_surface-insitu_1_ccgg_MonthlyData.txt
  • ctd21125.15w
  • ctd22187.00t.txt
  • enametC1.b1.20221109.000000.cdf
  • gucmetM1.b1.20230301.000000.cdf
  • list_of_files.txt
  • maraosmetM1.a1.20180201.000000.nc
  • marirtsstM1.b1.20190320.000000.nc
  • marnavM1.a1.20180201.000000.nc
  • met_brw_insitu_1_obop_hour_2020.txt
  • met_lcl.nc
  • mosaossp2M1.00.20191216.000601.raw.20191216000000.ini
  • mosaossp2M1.00.20191216.130601.raw.20191216x193.sp2b
  • mosaossp2auxM1.00.20191217.010801.raw.20191216000000.hk
  • nsacloudphaseC1.c1.20180601.000000.nc
  • nsasurfspecalb1mlawerC1.c1.20160609.080000.nc
  • sgp30ebbrE13.b1.20190601.000000.nc
  • sgp30ebbrE32.b1.20191125.000000.nc
  • sgp30ebbrE32.b1.20191130.000000.nc
  • sgp30ecorE14.b1.20190601.000000.cdf
  • sgpaerich1C1.b1.20190501.000342.nc
  • sgpaosacsmE13.b2.20230420.000109.nc
  • sgpaosccn2colaE13.b1.20170903.000000.nc
  • sgpbrsC1.b1.20190705.000000.cdf
  • sgpceilC1.b1.20190101.000000.nc
  • sgpco2flx4mC1.b1.20201007.001500.nc
  • sgpdlppiC1.b1.20191015.120023.cdf
  • sgpdlppiC1.b1.20191015.121506.cdf
  • sgpirt25m20sC1.a0.20190601.000000.cdf
  • sgpmetE13.b1.20190101.000000.cdf
  • sgpmetE13.b1.20190102.000000.cdf
  • sgpmetE13.b1.20190103.000000.cdf
  • sgpmetE13.b1.20190104.000000.cdf
  • sgpmetE13.b1.20190105.000000.cdf
  • sgpmetE13.b1.20190106.000000.cdf
  • sgpmetE13.b1.20190107.000000.cdf
  • sgpmetE13.b1.20190508.000000.cdf
  • sgpmetE13.b1.20210401.000000.csv
  • sgpmetE13.b1.yaml
  • sgpmetE15.b1.20190508.000000.cdf
  • sgpmetE31.b1.20190508.000000.cdf
  • sgpmetE32.b1.20190508.000000.cdf
  • sgpmetE33.b1.20190508.000000.cdf
  • sgpmetE34.b1.20190508.000000.cdf
  • sgpmetE35.b1.20190508.000000.cdf
  • sgpmetE36.b1.20190508.000000.cdf
  • sgpmetE37.b1.20190508.000000.cdf
  • sgpmetE38.b1.20190508.000000.cdf
  • sgpmetE39.b1.20190508.000000.cdf
  • sgpmetE40.b1.20190508.000000.cdf
  • sgpmetE9.b1.20190508.000000.cdf
  • sgpmet_no_time.nc
  • sgpmet_test_time.nc
  • sgpmfrsr7nchE11.b1.20210329.070000.nc
  • sgpmmcrC1.b1.1.cdf
  • sgpmmcrC1.b1.2.cdf
  • sgpmplpolfsC1.b1.20190502.000000.cdf
  • sgprlC1.a0.20160131.000000.nc
  • sgpsebsE14.b1.20190601.000000.cdf
  • sgpsirsE13.b1.20190101.000000.cdf
  • sgpsondewnpnC1.b1.20190101.053200.cdf
  • sgpstampE13.b1.20200101.000000.nc
  • sgpstampE31.b1.20200101.000000.nc
  • sgpstampE32.b1.20200101.000000.nc
  • sgpstampE33.b1.20200101.000000.nc
  • sgpstampE34.b1.20200101.000000.nc
  • sgpstampE9.b1.20200101.000000.nc
  • sodar.20230404.mnd
  • twpsondewnpnC3.b1.20060119.050300.custom.cdf
  • twpsondewnpnC3.b1.20060119.112000.custom.cdf
  • twpsondewnpnC3.b1.20060119.163300.custom.cdf
  • twpsondewnpnC3.b1.20060119.231600.custom.cdf
  • twpsondewnpnC3.b1.20060120.043800.custom.cdf
  • twpsondewnpnC3.b1.20060120.111900.custom.cdf
  • twpsondewnpnC3.b1.20060120.170800.custom.cdf
  • twpsondewnpnC3.b1.20060120.231500.custom.cdf
  • twpsondewnpnC3.b1.20060121.051500.custom.cdf
  • twpsondewnpnC3.b1.20060121.111600.custom.cdf
  • twpsondewnpnC3.b1.20060121.171600.custom.cdf
  • twpsondewnpnC3.b1.20060121.231600.custom.cdf
  • twpsondewnpnC3.b1.20060122.052600.custom.cdf
  • twpsondewnpnC3.b1.20060122.111500.custom.cdf
  • twpsondewnpnC3.b1.20060122.171800.custom.cdf
  • twpsondewnpnC3.b1.20060122.232600.custom.cdf
  • twpsondewnpnC3.b1.20060123.052500.custom.cdf
  • twpsondewnpnC3.b1.20060123.111700.custom.cdf
  • twpsondewnpnC3.b1.20060123.171600.custom.cdf
  • twpsondewnpnC3.b1.20060123.231500.custom.cdf
  • twpsondewnpnC3.b1.20060124.051500.custom.cdf
  • twpsondewnpnC3.b1.20060124.111800.custom.cdf
  • twpsondewnpnC3.b1.20060124.171700.custom.cdf
  • twpsondewnpnC3.b1.20060124.231500.custom.cdf
  • twpvisstgridirtemp.c1.20050705.002500.nc
  • vdis.b1

Adding new datasets

To add a new dataset file, please follow these steps:

  1. Add the dataset file to the data/ directory
  2. From the command line, run python make_registry.py script to update the registry file residing in arm-test-data/registry.txt
  3. Commit and push your changes to GitHub

Using datasets in notebooks and/or scripts

  • Ensure the arm-test-data package is installed in your environment

```bash python -m pip install arm-test-data

# or

python -m pip install git+https://github.com/ARM-DOE/arm-test-data

# or

conda install -c conda-forge arm-test-data ```

  • Import DATASETS and inspect the registry to find out which datasets are available

```python In [1]: from armtestdata import DATASETS

In [2]: DATASETS.registryfiles Out[2]: ['samplefile.nc] ``

  • To fetch a data file of interest, use the .fetch method and provide the filename of the data file. This will

    • download and cache the file if it doesn't exist already.
    • retrieve and return the local path

```python In [4]: filepath = DATASETS.fetch('sample_data.nc')

In [5]: filepath Out[5]: '/Users/mgrover/Library/Caches/arm-test-data/samplesgpdata.nc' ```

  • Once you have access to the local filepath, you can then use it to load your dataset into pandas or xarray or your package of choice:

python In [6]: radar = pyart.io.read(filepath)

Changing the default data cache location

The default cache location (where the data are saved on your local system) is dependent on the operating system. You can use the locate() method to identify it:

python from arm_test_data import locate locate()

The location can be overwritten by the ACT_TEST_DATA_DIR environment variable to the desired destination.

References

Ameriflux data

AmeriFlux BASE: https://doi.org/10.17190/AMF/2531143 Citation: Bhupendra Raut, Sujan Pal, Paytsar Muradyan, Joseph R. O'Brien, Max Berkelhammer, Matthew Tuftedal, Max Grover, Scott Collis, Robert C. Jackson (2025), AmeriFlux BASE US-CU1 UIC Plant Research Laboratory Chicago, Ver. 1-5, AmeriFlux AMP, (Dataset). https://doi.org/10.17190/AMF/2531143

Owner

  • Name: ARM User Facility
  • Login: ARM-DOE
  • Kind: organization

GitHub Events

Total
  • Release event: 2
  • Watch event: 2
  • Delete event: 13
  • Issue comment event: 15
  • Push event: 11
  • Pull request review event: 12
  • Pull request event: 27
  • Create event: 12
Last Year
  • Release event: 2
  • Watch event: 2
  • Delete event: 13
  • Issue comment event: 15
  • Push event: 11
  • Pull request review event: 12
  • Pull request event: 27
  • Create event: 12

Committers

Last synced: almost 2 years ago

All Time
  • Total Commits: 27
  • Total Committers: 5
  • Avg Commits per committer: 5.4
  • Development Distribution Score (DDS): 0.444
Past Year
  • Commits: 27
  • Committers: 5
  • Avg Commits per committer: 5.4
  • Development Distribution Score (DDS): 0.444
Top Committers
Name Email Commits
mgrover1 m****x@g****m 15
zssherman s****1@g****m 8
AdamTheisen a****n@a****v 2
Bobby Jackson r****n@a****v 1
Ken Kehoe k****e@o****u 1
Committer Domains (Top 20 + Academic)
anl.gov: 2 ou.edu: 1

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 1
  • Total pull requests: 57
  • Average time to close issues: 8 minutes
  • Average time to close pull requests: about 12 hours
  • Total issue authors: 1
  • Total pull request authors: 6
  • Average comments per issue: 2.0
  • Average comments per pull request: 0.49
  • Merged pull requests: 54
  • Bot issues: 0
  • Bot pull requests: 28
Past Year
  • Issues: 0
  • Pull requests: 28
  • Average time to close issues: N/A
  • Average time to close pull requests: about 16 hours
  • Issue authors: 0
  • Pull request authors: 4
  • Average comments per issue: 0
  • Average comments per pull request: 0.68
  • Merged pull requests: 25
  • Bot issues: 0
  • Bot pull requests: 24
Top Authors
Issue Authors
  • AdamTheisen (1)
Pull Request Authors
  • dependabot[bot] (28)
  • zssherman (15)
  • AdamTheisen (5)
  • kenkehoe (4)
  • mgrover1 (4)
  • rcjackson (1)
Top Labels
Issue Labels
Pull Request Labels
dependencies (28) github_actions (3)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 3,675 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 15
  • Total maintainers: 2
pypi.org: arm-test-data

Provides utility functions for accessing data repository for ARM data examples/notebooks

  • Documentation: https://arm-test-data.readthedocs.io/
  • License: MIT License Copyright (c) 2023 ARM User Facility Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
  • Latest release: 0.1.1
    published 12 months ago
  • Versions: 15
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 3,675 Last month
Rankings
Dependent packages count: 9.9%
Average: 38.8%
Dependent repos count: 67.8%
Maintainers (2)
Last synced: 7 months ago

Dependencies

.github/workflows/ci.yaml actions
  • actions/checkout v3 composite
  • codecov/codecov-action v3.1.4 composite
  • fkirc/skip-duplicate-actions master composite
  • mamba-org/provision-with-micromamba main composite
  • styfle/cancel-workflow-action 0.12.0 composite
.github/workflows/pypi-release.yml actions
  • actions/checkout v3 composite
  • actions/download-artifact v3 composite
  • actions/setup-python v4 composite
  • actions/upload-artifact v3 composite
  • pypa/gh-action-pypi-publish v1.8.10 composite
pyproject.toml pypi
requirements.txt pypi
  • pooch *
ci/environment.yml pypi