cosore

Data, metadata, and software tools for the COSORE database of continuous soil respiration measurements

https://github.com/bpbond/cosore

Science Score: 33.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: wiley.com, nature.com
✓
Committers with academic emails
2 of 3 committers (66.7%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (15.8%) to scientific vocabulary

Keywords

carbon-cycle earth-sciences science soil-respiration

Last synced: 6 months ago · JSON representation

Repository

Data, metadata, and software tools for the COSORE database of continuous soil respiration measurements

Basic Info

Host: GitHub
Owner: bpbond
License: cc-by-4.0
Language: R
Default Branch: master
Homepage:
Size: 570 MB

Statistics

Stars: 33
Watchers: 5
Forks: 10
Open Issues: 43
Releases: 0

Topics

carbon-cycle earth-sciences science soil-respiration

Created almost 7 years ago · Last pushed over 4 years ago

Metadata Files

Readme License Code of conduct

cosore

A first data analysis using COSORE is published!

The Global Change Biology paper is published!

The cosore package consists of data, metadata, and software tools for COSORE, a reproducibility-oriented community database for continuous soil respiration data.

To use the database from within R, install this cosore package by for example devtools::install_github("bpbond/cosore").

To download the COSORE database in a flat-file format, i.e. accessible by any data analysis tool, click on the Releases tab above.

A step-by-step guide to using COSORE is available here.

To contribute to the database, fill out the metadata form.

Principles and general information

Only free use data (CC BY 4) accepted. We request that users cite the database definition paper, and strongly encourage them to (i) cite all dataset primary publications, and (ii) involve data contributors as co-authors when possible.

The package, and the process of contributing and accessing data, should be as focused and simple as possible (but no simpler).

All data contributors will be included on an introductory database paper planned for spring 2020.

COSORE is not designed to be, and should not be treated as, a permanent data repository. It is a community database, but not an institutionally-backed repository like Figshare, DataONE, ESS-DIVE, etc. We recommend (but not require) depositing your data in one of these first, and providing its DOI in your COSORE dataset metadata.

Database design

This database is comprised of a collection of datasets, each converted to a standard format and units. A dataset is one or more files of continuous (automated) soil respiration data, with accompanying metadata, with all measurements taken at a single site and with constant treatment assignments (i.e. they may vary between chambers but not over time).

COSORE is designed to be a relatively lightweight database, and metadata are kept to a minimum. Each dataset has seven tables:

description - includes data on the site name; location; timezone name and IGBP cover type; measurement instrument; publication and data links; and acknowledgments and notes.
contributors - contributor information, including name, email, ORCID, and CRediT role.
ports - continuous systems typically, but not always, are comprised of a single analyzer plumbed to multiple chambers through a multiplexer. This table lists, for each multiplexer port, measurement variable (typically Rs, Rh, or NEE); treatment; species, and chamber/collar details.
columns - describes the mapping between the raw dataset fields and standardized COSORE fields; used during the import of raw (contributed) data.
ancillary - arbitrary ancillary data: stand structure, carbon cycle, disturbance, etc. [All optional.]
data - the actual chamber respiration data, with many possible fields including the required ones: beginning and end timestamps, flux rate, and port number. May also include meteorological and soil data, flux fit diagnostics, error codes, etc.
diagnostics - this is generated by the data import process, and summarizes records that were dropped, problems found, etc.

Operation

Four primary functions are available for R users: * csr_database() returns a summary data frame about the entire database (all constituent datasets) * csr_dataset() returns a single dataset, as a list of data frames * csr_table() returns a single table, across one or many datasets * csr_metadata() returns a metadata table describing all fields in dataset tables

Reports can be generated for the overall database (csr_report_database()) and each individual dataset (csr_report_dataset()). There are a number of developer functions as well, i.e. not intended for the average COSORE user. Perhaps most importantly this includes csr_build(), which scans for and parses metadata on all installed datasets, then loads the data, parsing raw data as necessary and available.

Data access

R users will find it easiest to install this package and then use the functions above. Anyone can also download flat (csv) files from the Releases page.

Data priorities

Structured/standardized continuous IRGA data
Raw LI-8100A data
Unusual or long-term survey (i.e. not continuous) measurements

Owner

Name: Ben Bond-Lamberty
Login: bpbond
Kind: user
Location: College Park, MD, USA
Company: PNNL/UMD

Twitter: benbondlamberty
Repositories: 33
Profile: https://github.com/bpbond

Forest ecologist & carbon cycle scientist at @PNNLab , open data & R proponent, mentor, editor. I know how to do a blind hem stitch. He/him/his.

GitHub Events

Total

Issues event: 2
Watch event: 2
Issue comment event: 1

Last Year

Issues event: 2
Watch event: 2
Issue comment event: 1

Committers

Last synced: 10 months ago

All Time

Total Commits: 780
Total Committers: 3
Avg Commits per committer: 260.0
Development Distribution Score (DDS): 0.018

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Ben Bond-Lamberty	b**y@p**v	766
Stephanie Pennington	s**1@t**u	13
jinshijian	j**4@g**m	1

Committer Domains (Top 20 + Academic)

terpmail.umd.edu: 1 pnnl.gov: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 193
Total pull requests: 56
Average time to close issues: about 1 month
Average time to close pull requests: 1 day
Total issue authors: 9
Total pull request authors: 3
Average comments per issue: 1.05
Average comments per pull request: 1.41
Merged pull requests: 54
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 3
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 1
Pull request authors: 0
Average comments per issue: 0.33
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

bpbond (182)
stephpenn1 (4)
AlexisRenchon (1)
jinshijian (1)
kendalynnm (1)
arthur-e (1)
hrdawson (1)
BernhardAhrens (1)
MWKopp (1)

Pull Request Authors

bpbond (50)
stephpenn1 (5)
jinshijian (1)

Top Labels

Issue Labels

data (62) bug (6) two-stage (2) enhancement (1) question (1)

Pull Request Labels

Dependencies

DESCRIPTION cran

R >= 3.5.0 depends
knitr * imports
lubridate * imports
rmarkdown * imports
tibble >= 2.1 imports
ggplot2 * suggests
kableExtra * suggests
leaflet * suggests
rcrossref * suggests
sp * suggests
testthat * suggests

cosore

Science Score: 33.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

cosore

Principles and general information

Database design

Operation

Data access

Data priorities

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies