htmldate
htmldate: A Python package to extract publication dates from web pages - Published in JOSS (2020)
Generating CodeMeta Metadata for R Packages
Generating CodeMeta Metadata for R Packages - Published in JOSS (2017)
cffr
cffr: Generate Citation File Format Metadata for R Packages - Published in JOSS (2021)
cfdm
cfdm: A Python reference implementation of the CF data model - Published in JOSS (2020)
Metasyn
Metasyn: Transparent Generation of Synthetic Tabular Data with Privacy Guarantees - Published in JOSS (2025)
ck
Collective Knowledge (CK), Collective Mind (CM/CMX) and MLPerf automations: community-driven projects to facilitate collaborative and reproducible research and to learn how to run AI, ML, and other emerging workloads more efficiently and cost-effectively across diverse models, datasets, software, and hardware using MLPerf methodology and benchmarks
widoco
Wizard for documenting ontologies. WIDOCO is a step by step generator of HTML templates with the documentation of your ontology. It uses the LODE environment to create part of the template.
micrometaapp-react
This is the React implementation of Micro-Meta App, Microscopy Metadata for the real world!
micrometaapp-electron
Electron wrapped, stand-alone version of Micro-Meta App, microscopy metadata for the real world!
micrometaapp-omero
This is a limited-functionality, beta - implementation of the integration of Micro-Meta App in the OMERO.web client
oemetadata
The OEMetadata is an energy metadata standard including a metadata schema, templates, and examples.
qa-catalogue
QA Catalogue – a metadata quality assessment tool for library catalogue records (MARC, PICA, UNIMARC)
proteomics-sample-metadata
The Proteomics sample metadata: Standard for experimental design annotation in proteomics datasets
odis-arch
Development of the Ocean Data and Information System (ODIS) architecture
core-geonetwork
GeoNetwork is a catalog application to manage spatially referenced resources. It provides powerful metadata editing and search functions as well as an interactive web map viewer. It is currently used in numerous Spatial Data Infrastructure initiatives across the world.
nbomicroscopymetadataspecs
The 4DN + BINA Tiered System of Microscopy Metadata Guidelines was developed by the 4DN Imaging Standards Working Group in collaboration with the BINA Data Management and Quality Control Working Group (https://www.bioimagingna.org/qc-dm-wg) for documenting microscopy experiments and assessing the quality of image data.
zowie
Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)
4dnmetadataschemaxsd2jsonconverter
This is a converter written in Java that translates an XSD microscopy metadata schema into JSON
canada-labour-research-assistant
The Canada Labour Research Assistant (CLaRA) is a privacy-first LLM-powered RAG AI assistant proposing Easily Verifiable Direct Quotations (EVDQ) to mitigate hallucinations in answering questions about Canadian labour laws, standards, and regulations. It works entirely offline and locally, guaranteeing the confidentiality of your conversations.
py3data
A flexible and lightweight Python interface to the re3data.org database
brep-part-finder
A Python package to identify the part ID number in Brep format CAD files
BiocPkgTools
Computable build reports, package metadata, and download stats from the Bioconductor project
scif
scientific filesystem: a filesystem organization for scientific software and metadata
csv-metadata-quality
A simple but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem
obofoundry.github.io
Metadata and website for the Open Bio Ontologies Foundry Ontology Registry
guano
Python reference implementation of the GUANO bat acoustics metadata specification
PHES-ODM
The Public Health Environmental Surveillance Open Data Model (PHES-ODM, or ODM). A data model, dictionary and support tools for environmental surveillance.
https://github.com/biopragmatics/biolookup
🔍 The Biolookup Service retrieves metadata and ontological information about biomedical entities.
NPSdataverse
NPSdataverse: a suite of R packages for data processing, authoring Ecological Metadata Language metadata, checking data-metadata congruence, and accessing data - Published in JOSS (2025)
https://github.com/annakrystalli/dataspice-tutorial
Tutorial in practical research data management: creating simple metadata files using dataspice
https://github.com/antarctica/metadata-library
Python library for generating ISO 19115 metadata records.
https://github.com/centrefordigitalhumanities/edpop
A virtual research environment (VRE) that lets you collect, align and annotate bibliographical and biographical records from several online catalogs.
https://github.com/cdcgov/cdh-lava-react
CDC Data Hub Lifecycle, Analysis & Visualization Accelerator (LAVA) REACT Components based on machine readable requirements.
metafinder
Search for documents in a domain through Search Engines (Google, Bing and Baidu). The objective is to extract metadata
https://github.com/annakrystalli/emodnet-dataspice-tutorial
An Introduction to using R package Dataspice to collect metadata for EMODnet Biology Data Products
https://github.com/catalyst-cooperative/pudl-zenodo-storage
Tools for creating versioned archives of raw data on Zenodo using Frictionless data packages.
https://github.com/cdli-gh/data
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
datensatzbeschreibung_fdz_gesundheit
Dieses Repository enthält eine vorläufige Datensatzbeschreibung, welche über die Struktur der in Zukunft über das FDZ Gesundheit zugänglichen Daten informiert.
https://github.com/brain-development-and-disorders-lab/mars
Metadatify is an open-source web-based tool to create, manage, and search scientific metadata.
php-codemeta-crosswalk
Codemeta conversions - https://doi.org/10.5281/zenodo.12808884
https://github.com/cityjson/cityjson-qgis-plugin
A QGIS plugin that adds support for CityJSON files
https://github.com/dataesr/works-magnet
Works-magnet: Retrieve and promote the scholarly works of your institution.
mapmetadata
An R package to help researchers use publicly available health metadata to map variables onto research domains
dirschema
Spec and validator for directories, files and metadata based on JSON Schema and regexes.
zoinks
Command-line utility to return Zotero record field values given a Zotero select link, an item key, or even just a file attachment
3dcap-md-gen
Scripts for exporting scanning metadata as described in the publication "Metadata Schema and Ontology for Archaeological Object Documentation including 3D Imaging (AOD-3DI)"
framework
The main project containing the core C++ classes defining framework behaviour and primordial analysis and helper tools. It centralises all other rest-for-physics repositories through submodules.
https://github.com/cidgoh/grdi_amr_one_health
A data specification for harmonizing One Health AMR pathogen genomics contextual data. The specification provides standardized (ontology-based) fields and terms which are implemented via a spreadsheet collection template, supported by field and reference guides as well as different curation and new term request SOPs.
crowdtangle-api-schema
Simple ontological representation of the CrowdTangle API to be used in a Tropy template
imageomics-guide
Imageomics-focused guide to collaborative work, including GitHub and Hugging Face workflows.
https://github.com/bigbio/sdrf-cellline-metadata-db
SDRF Cell Line Metadata Database
https://github.com/dbetchkal/metadig
Use NSNSD site-based metadata to create iyore-style filters for data analysis
same-version
Automatically ensures your software version metadata is consistent across key project files.
https://github.com/colectica/ddiregistry
Registry software for the DDI Alliance Agency Registry
sum-buddy
Generate and save checksums for all (or certain) contents of given directory.
aiu
A library for interacting with web archive collections at Archive-It, Trove, Pandora, and more.
https://github.com/afuetterer/oaipmh-scythe
A Scythe for harvesting OAI-PMH repositories.
https://github.com/aidotse/multimodal-skin-lesion-classification
Mutlimodality for skin lesions classification