cosore

Data, metadata, and software tools for the COSORE database of continuous soil respiration measurements

https://github.com/bpbond/cosore

Science Score: 33.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: wiley.com, nature.com
  • Committers with academic emails
    2 of 3 committers (66.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.8%) to scientific vocabulary

Keywords

carbon-cycle earth-sciences science soil-respiration
Last synced: 6 months ago · JSON representation

Repository

Data, metadata, and software tools for the COSORE database of continuous soil respiration measurements

Basic Info
  • Host: GitHub
  • Owner: bpbond
  • License: cc-by-4.0
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 570 MB
Statistics
  • Stars: 33
  • Watchers: 5
  • Forks: 10
  • Open Issues: 43
  • Releases: 0
Topics
carbon-cycle earth-sciences science soil-respiration
Created almost 7 years ago · Last pushed over 4 years ago
Metadata Files
Readme License Code of conduct

README.md

Build Status

codecov

cosore

A first data analysis using COSORE is published!

The Global Change Biology paper is published!

The cosore package consists of data, metadata, and software tools for COSORE, a reproducibility-oriented community database for continuous soil respiration data.

To use the database from within R, install this cosore package by for example devtools::install_github("bpbond/cosore").

To download the COSORE database in a flat-file format, i.e. accessible by any data analysis tool, click on the Releases tab above.

A step-by-step guide to using COSORE is available here.

To contribute to the database, fill out the metadata form.

Principles and general information

Only free use data (CC BY 4) accepted. We request that users cite the database definition paper, and strongly encourage them to (i) cite all dataset primary publications, and (ii) involve data contributors as co-authors when possible.

The package, and the process of contributing and accessing data, should be as focused and simple as possible (but no simpler).

All data contributors will be included on an introductory database paper planned for spring 2020.

COSORE is not designed to be, and should not be treated as, a permanent data repository. It is a community database, but not an institutionally-backed repository like Figshare, DataONE, ESS-DIVE, etc. We recommend (but not require) depositing your data in one of these first, and providing its DOI in your COSORE dataset metadata.

Database design

This database is comprised of a collection of datasets, each converted to a standard format and units. A dataset is one or more files of continuous (automated) soil respiration data, with accompanying metadata, with all measurements taken at a single site and with constant treatment assignments (i.e. they may vary between chambers but not over time).

COSORE is designed to be a relatively lightweight database, and metadata are kept to a minimum. Each dataset has seven tables:

  • description - includes data on the site name; location; timezone name and IGBP cover type; measurement instrument; publication and data links; and acknowledgments and notes.
  • contributors - contributor information, including name, email, ORCID, and CRediT role.
  • ports - continuous systems typically, but not always, are comprised of a single analyzer plumbed to multiple chambers through a multiplexer. This table lists, for each multiplexer port, measurement variable (typically Rs, Rh, or NEE); treatment; species, and chamber/collar details.
  • columns - describes the mapping between the raw dataset fields and standardized COSORE fields; used during the import of raw (contributed) data.
  • ancillary - arbitrary ancillary data: stand structure, carbon cycle, disturbance, etc. [All optional.]
  • data - the actual chamber respiration data, with many possible fields including the required ones: beginning and end timestamps, flux rate, and port number. May also include meteorological and soil data, flux fit diagnostics, error codes, etc.
  • diagnostics - this is generated by the data import process, and summarizes records that were dropped, problems found, etc.

Operation

Four primary functions are available for R users: * csr_database() returns a summary data frame about the entire database (all constituent datasets) * csr_dataset() returns a single dataset, as a list of data frames * csr_table() returns a single table, across one or many datasets * csr_metadata() returns a metadata table describing all fields in dataset tables

Reports can be generated for the overall database (csr_report_database()) and each individual dataset (csr_report_dataset()). There are a number of developer functions as well, i.e. not intended for the average COSORE user. Perhaps most importantly this includes csr_build(), which scans for and parses metadata on all installed datasets, then loads the data, parsing raw data as necessary and available.

Data access

R users will find it easiest to install this package and then use the functions above. Anyone can also download flat (csv) files from the Releases page.

Data priorities

  • Structured/standardized continuous IRGA data
  • Raw LI-8100A data
  • Unusual or long-term survey (i.e. not continuous) measurements

Owner

  • Name: Ben Bond-Lamberty
  • Login: bpbond
  • Kind: user
  • Location: College Park, MD, USA
  • Company: PNNL/UMD

Forest ecologist & carbon cycle scientist at @PNNLab , open data & R proponent, mentor, editor. I know how to do a blind hem stitch. He/him/his.

GitHub Events

Total
  • Issues event: 2
  • Watch event: 2
  • Issue comment event: 1
Last Year
  • Issues event: 2
  • Watch event: 2
  • Issue comment event: 1

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 780
  • Total Committers: 3
  • Avg Commits per committer: 260.0
  • Development Distribution Score (DDS): 0.018
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Ben Bond-Lamberty b****y@p****v 766
Stephanie Pennington s****1@t****u 13
jinshijian j****4@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 193
  • Total pull requests: 56
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 1 day
  • Total issue authors: 9
  • Total pull request authors: 3
  • Average comments per issue: 1.05
  • Average comments per pull request: 1.41
  • Merged pull requests: 54
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.33
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • bpbond (182)
  • stephpenn1 (4)
  • AlexisRenchon (1)
  • jinshijian (1)
  • kendalynnm (1)
  • arthur-e (1)
  • hrdawson (1)
  • BernhardAhrens (1)
  • MWKopp (1)
Pull Request Authors
  • bpbond (50)
  • stephpenn1 (5)
  • jinshijian (1)
Top Labels
Issue Labels
data (62) bug (6) two-stage (2) enhancement (1) question (1)
Pull Request Labels

Dependencies

DESCRIPTION cran
  • R >= 3.5.0 depends
  • knitr * imports
  • lubridate * imports
  • rmarkdown * imports
  • tibble >= 2.1 imports
  • ggplot2 * suggests
  • kableExtra * suggests
  • leaflet * suggests
  • rcrossref * suggests
  • sp * suggests
  • testthat * suggests