dwc-mapping

A document explaining how we will map Neotoma against the DarwinCore Schema

https://github.com/neotomadb/dwc-mapping

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.2%) to scientific vocabulary
Last synced: 8 months ago · JSON representation ·

Repository

A document explaining how we will map Neotoma against the DarwinCore Schema

Basic Info
  • Host: GitHub
  • Owner: NeotomaDB
  • License: mit
  • Language: HTML
  • Default Branch: master
  • Size: 8.91 MB
Statistics
  • Stars: 0
  • Watchers: 5
  • Forks: 0
  • Open Issues: 7
  • Releases: 0
Created almost 10 years ago · Last pushed about 1 year ago
Metadata Files
Readme License Code of conduct Citation

README.md

lifecycle

Mapping Neotoma against the DarwinCore Schema

Cyber4Paleo Development Workshop Logo

This repository tracks the development of efforts to map Neotoma dataset records against the DarwinCore schema to facilitate greater data discovery, reuse and sustainability of records archived within the Neotoma Paleoecological Database. This project is part of the EarthCube Integrative Activities proposal between Neotoma and the Paleobiological Database, and is one step along the path to upload Neotoma records to BISON and GBIF.

Initial work on this project was made possible through collaboration as part of the Cyber4Paleo Community Development Workshop in Boulder, CO, July, 2016. Much of this work is archived as part of the Cyber4Paleo GitHub organization and GitHub pages.

This work is carried out by the Earthlife Consortium, funded by NSF through the EarthCube initiative.

Contributors

We welcome contributions from any individual, whether code, documentation, or issue tracking. All participants are expected to follow the code of conduct for this project.

Description

Mapping the Neotoma Database structure onto DarwinCore standards is relatively complex. While some of the data structure maps easily, the content of the database, and the conceptual structure of the paleoecological records is not consistently equivalent to the semantic structure of the DarwinCore schema. The Rmd has some simple relationships described in the markdown portion of the document, based on a cross-walk started by Michael McClennan, and extended by Jack Williams and Mark Uhen at the Cyber4Paleo Community Development Workshop. Simon Goring developed the Rmd and implemented the actual conversion of the database structure to the csv file output.

How to Use this Repository

The database itself is available as a SQL Server snapshot from the Neotoma Paleoecological Database's website here, or on figshare.org at the Neotoma Database Snapshot project.

With the snapshot loaded into your local server, replace the connection string in functionalized_run.R (around line 27) and the code should "just run", provided you have the required packages. In this case you need libraries RODBC, neotoma, dplyr and tidyr.

Key TODOs

  • If there are missing fields, or poorly coded fields, feel free to provide suggestions.
  • If there are efficiencies in coding, feel free to provide them
  • If you feel documentation is incomplete, feel free to suggest imrpovements
  • I'd (ideally)like to improve the Rmd so that it is, in some sense, publishable as a data/methods paper. We welcome contribution that would assist in this effort. If you feel like you would be able to contribute significantly enough to be considered an author please contact us first.

Support

This work is supported through the National Science Foundation's EarthCube Initiative through NSF Award Numbers 1541002 and 1340301.

Owner

  • Name: The Neotoma Paleoecology Database Collective
  • Login: NeotomaDB
  • Kind: organization
  • Location: Global

Data and code supporting collaboration and outreach around the Neotoma Paleoecology Database

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Goring"
  given-names: "Simon"
  orcid: "https://orcid.org/0000-0002-2700-4605"
- family-names: "Williams"
  given-names: "Jack"
- family-names: "Uhen"
  given-names: "Mark"
- family-names: "McClennan"
  given-names: "Michael"
- family-names: "Wieczorek"
  given-names: "John"
title: "Neotoma-DarwinCore Crosswalk"
version: 0.1.0
date-released: 2023-11-22
url: "https://github.com/NeotomaDB/DwC-Mapping"

GitHub Events

Total
  • Delete event: 1
  • Issue comment event: 1
  • Push event: 1
  • Pull request event: 6
  • Create event: 3
Last Year
  • Delete event: 1
  • Issue comment event: 1
  • Push event: 1
  • Pull request event: 6
  • Create event: 3

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 39
  • Total Committers: 2
  • Avg Commits per committer: 19.5
  • Development Distribution Score (DDS): 0.077
Past Year
  • Commits: 3
  • Committers: 1
  • Avg Commits per committer: 3.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Simon s****g@g****m 36
Erik Zepeda E****9@g****m 3

Issues and Pull Requests

Last synced: 9 months ago