https://github.com/bge-barcoding/bge-skimming-analytics
Genome skimming assembly and validation analytics for the BGE project
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.8%) to scientific vocabulary
Last synced: 9 months ago
·
JSON representation
Repository
Genome skimming assembly and validation analytics for the BGE project
Basic Info
- Host: GitHub
- Owner: bge-barcoding
- License: mit
- Language: Perl
- Default Branch: main
- Size: 18.9 MB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Created 12 months ago
· Last pushed 9 months ago
Metadata Files
Readme
License
README.md
bge-skimming-analytics
Genome skimming assembly and validation analytics for the BGE project. The roadmap for this activity is as follows:
- The Naturalis and NHM teams aggregate their TSV files out of the barcode validator in this repo.
- We define the headings of the TSV files in a format compatible with frictionless data. This means formulating a JSON file that follows the syntax of this example, which is what BOLD uses for their TSV dumps. Our headings are different, but we have their definitions nearly all set up thanks to this table by Dan Parsons.
- We combine the TSVs into a large table and compute the MD5 checksum, which goes into the JSON. We now have a frictionless data package that can be imported into R to run stats about the genome skimming, e.g. for BGE deliverable reporting.
- We then combine JSON and TSV into an RO-Crate. For this we follow the profile that Eli Chadwick has been working on.
- We upload the RO-Crate to Zenodo and mint a DOI for it. We now have a state-of-the-art FAIR data package. Bonus points for linking it to the DOI of a data set on BOLD (a data set is just a container of process IDs with some descriptive text). This way, the analytics data is linked to the published data, including specimen photos, collection localities, etc.
Owner
- Name: BGE barcoding
- Login: bge-barcoding
- Kind: organization
- Website: https://biodiversitygenomics.eu/
- Twitter: BioGenEurope
- Repositories: 1
- Profile: https://github.com/bge-barcoding
Biodiversity Genomics Europe (BGE) - (meta)barcoding software artifacts
GitHub Events
Total
- Push event: 6
- Create event: 2
Last Year
- Push event: 6
- Create event: 2