https://github.com/jcmcnch/global-rrna-univeral-metabarcoding-of-plankton

A central repository for the GRUMP (Global rRNA Universal Metabarcoding of Plankton) dataset, containing scripts for bioinformatic processing and retrieval of associated physical / geochemical measurements from publicly available data.

https://github.com/jcmcnch/global-rrna-univeral-metabarcoding-of-plankton

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: nature.com
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.3%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

A central repository for the GRUMP (Global rRNA Universal Metabarcoding of Plankton) dataset, containing scripts for bioinformatic processing and retrieval of associated physical / geochemical measurements from publicly available data.

Basic Info
  • Host: GitHub
  • Owner: jcmcnch
  • License: gpl-3.0
  • Language: Shell
  • Default Branch: main
  • Homepage:
  • Size: 17.3 MB
Statistics
  • Stars: 6
  • Watchers: 2
  • Forks: 2
  • Open Issues: 0
  • Releases: 0
Created over 3 years ago · Last pushed 11 months ago
Metadata Files
Readme License

README.md

Code Repository for Global rRNA Universal Metabarcoding of Plankton (GRUMP) Project

This repository contains code to reproduce metadata curation and bioinformatic analysis steps used to generate data for the GRUMP project.

Metabarcoding data:

These data will be available in 3 locations and formats:

  • Processed, merged ASV tables (containing all 16S and 18S amplicon sequences, corrected for observed biases against 18S, with the option to use uncorrected data) is now available on the Simons Collaborative Marine Atlas Project (Simons CMAP) website. In this "long" format, each observation of an ASV will be indexed by latitude, longitude, depth, and time. It will also contain some associated metadata where available from collaborators. If you are not sure which data to use, this is probably your best bet, and can be accessed here: https://simonscmap.com/catalog/datasets/GRUMP. It is, however, a highly redundant data format compared to a traditional ASV table. More information on the dataset is available in the Data Descriptor paper: https://www.nature.com/articles/s41597-025-05423-9
  • Intermediate files from qiime2 and other parts of the pipeline will be stored on the Open Science Foundation at https://osf.io/57dpa/ This repository will be useful for people who wish to delve into the data more deeply. For example, if you wanted to look only at a subset of a given dataset (e.g. only 18S/16S/chloroplast/Archaea/Bacteria sequences) those subsetted ASV tables can be found there. Or perhaps you want to plot some part of this dataset in phyloseq. In addition, you can find denoising statistics, biom tables, ASV sequences for BLAST comparisons, etc.
  • Raw demultiplexed data have been uploaded to SRA. These files contain primers so they need to be trimmed prior to reanalysis. These files would be only needed if you wanted to completely redo any aspects of our analysis from scratch. Accessions for these files are available in the above preprint.

Associated measurements:

  • Covariates measured alongside the metabarcoding samples will be available in this repository. Since each cruise section will have different data formats and covariates available, we will organize metadata into the subfolders of this repository. The repository contains a record of the intermediate processing steps needed to generate the metadata, as well as links to other relevant repositories.
  • The easiest way to access metadata is to follow the links provided in the README files in the subfolders.

Owner

  • Name: Jesse McNichol
  • Login: jcmcnch
  • Kind: user
  • Company: University of Southern California

I am currently a postdoctoral scholar in Jed Fuhrman's lab at USC. Please see my website/CV for more information about my interests and experience.

GitHub Events

Total
  • Watch event: 3
  • Push event: 1
Last Year
  • Watch event: 3
  • Push event: 1