Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: sciencedirect.com, nature.com
  • Committers with academic emails
    4 of 11 committers (36.4%) from academic institutions
  • Institutional organization owner
    Organization niehs has institutional domain (www.niehs.nih.gov)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

Basic Info
Statistics
  • Stars: 10
  • Watchers: 3
  • Forks: 2
  • Open Issues: 22
  • Releases: 0
Created about 2 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog License

README.md

amadeus amadeus website

R-CMD-check cov lint pkgdown Project Status: Active  The project has reached a stable, usable state and is being actively developed. CRAN downloads

amadeus is a mechanism for data, environments, and user setup for common environmental and weather datasets in R. amadeus has been developed to improve access to and utility with large scale, publicly available environmental data in R.

See the peer-reviewed publication, Amadeus: Accessing and analyzing large scale environmental data in R, for full description and details.

Cite amadeus as:

Manware, M., Song, I., Marques, E. S., Kassien, M. A., Clark, L. P., & Messier, K. P. (2025). Amadeus: Accessing and analyzing large scale environmental data in R. Environmental Modelling & Software, 186, 106352.

Installation

amadeus can be installed from CRAN with install.packages or from GitHub with pak.

r install.packages("amadeus") r pak::pak("NIEHS/amadeus")

Download

download_data accesses and downloads raw geospatial data from a variety of open source data repositories. The function is a wrapper that calls source-specific download functions, each of which account for the source's unique combination of URL, file naming conventions, and data types. Download functions cover the following sources:

| Data Source | File Type | Data Genre | Spatial Extent | Function Suffix | | :---- | :-- | :--- | :--- | :--- | | Climatology Lab TerraClimate | netCDF | Meteorology | Global | _terraclimate | | Climatology Lab GridMet | netCDF | Climate
Water | Contiguous United States | _gridmet | | Kppen-Geiger Climate Classification | GeoTIFF | Climate Classification | Global | _koppen_geiger | | MRLC[^1] Consortium National Land Cover Database (NLCD) | GeoTIFF | Land Use | United States | _nlcd | | NASA[^2] Moderate Resolution Imaging Spectroradiometer (MODIS) | HDF | Atmosphere
Meteorology
Land Use
Satellite | Global | _modis | | NASA Modern-Era Retrospective analysis for Research and Applications, Version 2 (MERRA-2) | netCDF | Atmosphere
Meteorology | Global | _merra2 | | NASA SEDAC[^3] UN WPP-Adjusted Population Density | GeoTIFF
netCDF | Population | Global | _population | | NASA SEDAC Global Roads Open Access Data Set | Shapefile
Geodatabase | Roadways | Global | _groads | | NASA Goddard Earth Observing System Composition Forcasting (GEOS-CF) | netCDF | Atmosphere
Meteorology | Global | _geos | | NOAA Hazard Mapping System Fire and Smoke Product | Shapefile
KML | Wildfire Smoke | North America | _hms | | NOAA NCEP[^4] North American Regional Reanalysis (NARR) | netCDF | Atmosphere
Meteorology | North America | _narr | | US EPA[^5] Air Data Pre-Generated Data Files | CSV | Air Pollution | United States | _aqs | | US EPA Ecoregions | Shapefile | Climate Regions | North America | _ecoregions | | US EPA National Emissions Inventory (NEI) | CSV | Emissions | United States | _nei | | US EPA Toxic Release Inventory (TRI) Program | CSV | Chemicals
Pollution | United States | _tri | | USGS[^6] Global Multi-resolution Terrain Elevation Data (GMTED2010) | ESRI ASCII Grid | Elevation | Global | _gmted |

See the "download_data" vignette for a detailed description of source-specific download functions.

Example use of download_data using NOAA NCEP North American Regional Reanalysis's (NARR) "weasd" (Daily Accumulated Snow at Surface) variable.

r directory <- "/ EXAMPLE / FILE / PATH /" download_data( dataset_name = "narr", year = 2022, variable = "weasd", directory_to_save = directory, acknowledgement = TRUE, download = TRUE, hash = TRUE ) Downloading requested files... Requested files have been downloaded. [1] "5655d4281b76f4d4d5bee234c2938f720cfec879" r list.files(file.path(directory, "weasd")) [1] "weasd.2022.nc"

Process

process_covariates imports and cleans raw geospatial data (downloaded with download_data), and returns a single SpatRaster or SpatVector into the user's R environment. process_covariates "cleans" the data by defining interpretable layer names, ensuring a coordinate reference system is present, and managing `timedata (if applicable).

To avoid errors when using process_covariates, do not edit the raw downloaded data files. Passing user-generated or edited data into process_covariates may result in errors as the underlying functions are adapted to each sources' raw data file type.

Example use of process_covariates using the downloaded "weasd" data.

r weasd_process <- process_covariates( covariate = "narr", date = c("2022-01-01", "2022-01-05"), variable = "weasd", path = file.path(directory, "weasd"), extent = NULL ) Detected monolevel data... Cleaning weasd data for 2022... Returning daily weasd data from 2022-01-01 to 2022-01-05. r weasd_process class : SpatRaster dimensions : 277, 349, 5 (nrow, ncol, nlyr) resolution : 32462.99, 32463 (x, y) extent : -16231.49, 11313351, -16231.5, 8976020 (xmin, xmax, ymin, ymax) coord. ref. : +proj=lcc +lat_0=50 +lon_0=-107 +lat_1=50 +lat_2=50 +x_0=5632642.22547 +y_0=4612545.65137 +datum=WGS84 +units=m +no_defs source : weasd.2022.nc:weasd varname : weasd (Daily Accumulated Snow at Surface) names : weasd_20220101, weasd_20220102, weasd_20220103, weasd_20220104, weasd_20220105 unit : kg/m^2, kg/m^2, kg/m^2, kg/m^2, kg/m^2 time : 2022-01-01 to 2022-01-05 UTC

Calculate Covariates

calculate_covariates stems from the beethoven project's need for various types of data extracted at precise locations. calculate_covariates, therefore, extracts data from the "cleaned" SpatRaster or SpatVector object at user defined locations. Users can choose to buffer the locations. The function returns a data.frame, sf, or SpatVector with data extracted at all locations for each layer or row in the SpatRaster or SpatVector object, respectively.

Example of calculate_covariates using processed "weasd" data.

r locs <- data.frame(id = "001", lon = -78.8277, lat = 35.95013) weasd_covar <- calculate_covariates( covariate = "narr", from = weasd_process, locs = locs, locs_id = "id", radius = 0, geom = "sf" ) Detected `data.frame` extraction locations... Calculating weasd covariates for 2022-01-01... Calculating weasd covariates for 2022-01-02... Calculating weasd covariates for 2022-01-03... Calculating weasd covariates for 2022-01-04... Calculating weasd covariates for 2022-01-05... Returning extracted covariates. r weasd_covar Simple feature collection with 5 features and 3 fields Geometry type: POINT Dimension: XY Bounding box: xmin: 8184606 ymin: 3523283 xmax: 8184606 ymax: 3523283 Projected CRS: unnamed id time weasd_0 geometry 1 001 2022-01-01 0.000000000 POINT (8184606 3523283) 2 001 2022-01-02 0.000000000 POINT (8184606 3523283) 3 001 2022-01-03 0.000000000 POINT (8184606 3523283) 4 001 2022-01-04 0.000000000 POINT (8184606 3523283) 5 001 2022-01-05 0.001953125 POINT (8184606 3523283)

Connecting Health Outcomes Research Data Systems

The amadeus package has been developed as part of the National Institute of Environmental Health Science's (NIEHS) Connecting Health Outcomes Research Data Systems (CHORDS) program. CHORDS aims to "build and strengthen data infrastructure for patient-centered outcomes research on environment and health" by providing curated data, analysis tools, and educational resources.

Additional Resources

The following R packages can also be used to access environmental and weather data in R, but each differs from amadeus in the data sources covered or type of functionality provided.

| Package | Source | | :--- | :----- | | dataRetrieval | USGS Hydrological Data and EPA Water Quality Data | | daymetr | Daymet | | ecmwfr | ECMWF Reanalysis v5 (ERA5) | | rNOMADS | NOAA Operational Model Archive and Distribution System | | sen2r[^8] | Sentinel-2 |

Contribution

To add or edit functionality for new data sources or datasets, open a Pull request into the main branch with a detailed description of the proposed changes. Pull requests must pass all status checks, and then will be approved or rejected by amadeus's authors.

Utilize Issues to notify the authors of bugs, questions, or recommendations. Identify each issue with the appropriate label to help ensure a timely response.

[^1]: Multi-Resolution Land Characteristics [^2]: National Aeronautics and Space Administration [^3]: Socioeconomic Data and Applications Center [^4]: National Centers for Environmental Prediction [^5]: United States Environmental Protection Agency [^6]: United States Geological Survey [^7]: Last updated more than two years ago. [^8]: Archived; no longer maintained.

Owner

  • Name: National Institute of Environmental Health Science
  • Login: NIEHS
  • Kind: organization
  • Location: Durham, NC

The mission of the National Institute of Environmental Health Sciences is to discover how the environment affects people in order to promote healthier lives.

GitHub Events

Total
  • Issues event: 54
  • Watch event: 3
  • Delete event: 17
  • Issue comment event: 55
  • Push event: 156
  • Pull request review comment event: 1
  • Pull request review event: 17
  • Pull request event: 29
  • Fork event: 1
  • Create event: 16
Last Year
  • Issues event: 54
  • Watch event: 3
  • Delete event: 17
  • Issue comment event: 55
  • Push event: 157
  • Pull request review comment event: 1
  • Pull request review event: 17
  • Pull request event: 30
  • Fork event: 1
  • Create event: 16

Committers

Last synced: 6 months ago

All Time
  • Total Commits: 590
  • Total Committers: 11
  • Avg Commits per committer: 53.636
  • Development Distribution Score (DDS): 0.317
Past Year
  • Commits: 135
  • Committers: 5
  • Avg Commits per committer: 27.0
  • Development Distribution Score (DDS): 0.148
Top Committers
Name Email Commits
mitchellmanware m****e@g****m 403
Insang Song i****g@n****v 117
Eva Marques m****l@c****v 19
Insang Song s****x@h****m 12
MAKassien m****a@g****m 8
kyle-messier k****r@n****v 8
Eva Marques m****l@e****v 7
{SET}group 1****r@u****m 7
MAKassien 7****n@u****m 3
Mitchell Manware 1****e@u****m 3
{SET}group 1****y@u****m 3
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 84
  • Total pull requests: 121
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 1 day
  • Total issue authors: 6
  • Total pull request authors: 5
  • Average comments per issue: 2.0
  • Average comments per pull request: 0.91
  • Merged pull requests: 88
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 36
  • Pull requests: 40
  • Average time to close issues: 11 days
  • Average time to close pull requests: 2 days
  • Issue authors: 5
  • Pull request authors: 4
  • Average comments per issue: 0.94
  • Average comments per pull request: 0.38
  • Merged pull requests: 26
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • mitchellmanware (43)
  • kyle-messier (21)
  • sigmafelix (9)
  • eva0marques (8)
  • MAKassien (2)
  • cmj2025 (1)
Pull Request Authors
  • mitchellmanware (65)
  • sigmafelix (24)
  • kyle-messier (19)
  • MAKassien (8)
  • eva0marques (5)
Top Labels
Issue Labels
manuscript (25) enhancement (15) bug (9) documentation (5) new dataset (4) checklist (4) question (3) recommendation (2) help wanted (2)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 270 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 7
  • Total maintainers: 1
cran.r-project.org: amadeus

Accessing and Analyzing Large-Scale Environmental Data

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 270 Last month
Rankings
Dependent packages count: 28.4%
Dependent repos count: 35.0%
Average: 50.1%
Downloads: 86.8%
Maintainers (1)
Last synced: 6 months ago

Dependencies

.github/workflows/check-standard.yaml actions
  • actions/checkout v4 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/lint.yaml actions
  • actions/checkout v4 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/pkgdown.yaml actions
  • JamesIves/github-pages-deploy-action v4.4.1 composite
  • actions/checkout v4 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
DESCRIPTION cran
  • R >= 4.1.0 depends
  • data.table * imports
  • doParallel * imports
  • dplyr * imports
  • exactextractr * imports
  • foreach * imports
  • future * imports
  • httr * imports
  • methods * imports
  • parallelly * imports
  • rvest * imports
  • sf * imports
  • stars * imports
  • stats * imports
  • stringr * imports
  • terra * imports
  • testthat >= 3.0.0 imports
  • utils * imports
  • FNN * suggests
  • covr * suggests
  • doRNG * suggests
  • knitr * suggests
  • lwgeom * suggests
  • rmarkdown * suggests
  • sftime * suggests
  • withr * suggests
.github/workflows/test-coverage-local.yaml actions
  • actions/cache v2 composite
  • actions/checkout v4 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
  • stefanzweifel/git-auto-commit-action v4 composite