htmldate
htmldate: A Python package to extract publication dates from web pages - Published in JOSS (2020)
Generating CodeMeta Metadata for R Packages
Generating CodeMeta Metadata for R Packages - Published in JOSS (2017)
cffr
cffr: Generate Citation File Format Metadata for R Packages - Published in JOSS (2021)
cfdm
cfdm: A Python reference implementation of the CF data model - Published in JOSS (2020)
Metasyn
Metasyn: Transparent Generation of Synthetic Tabular Data with Privacy Guarantees - Published in JOSS (2025)
ck
Collective Knowledge (CK), Collective Mind (CM/CMX) and MLPerf automations: community-driven projects to facilitate collaborative and reproducible research and to learn how to run AI, ML, and other emerging workloads more efficiently and cost-effectively across diverse models, datasets, software, and hardware using MLPerf methodology and benchmarks
widoco
Wizard for documenting ontologies. WIDOCO is a step by step generator of HTML templates with the documentation of your ontology. It uses the LODE environment to create part of the template.
micrometaapp-react
This is the React implementation of Micro-Meta App, Microscopy Metadata for the real world!
micrometaapp-electron
Electron wrapped, stand-alone version of Micro-Meta App, microscopy metadata for the real world!
micrometaapp-omero
This is a limited-functionality, beta - implementation of the integration of Micro-Meta App in the OMERO.web client
oemetadata
The OEMetadata is an energy metadata standard including a metadata schema, templates, and examples.
qa-catalogue
QA Catalogue – a metadata quality assessment tool for library catalogue records (MARC, PICA, UNIMARC)
proteomics-sample-metadata
The Proteomics sample metadata: Standard for experimental design annotation in proteomics datasets
odis-arch
Development of the Ocean Data and Information System (ODIS) architecture
core-geonetwork
GeoNetwork is a catalog application to manage spatially referenced resources. It provides powerful metadata editing and search functions as well as an interactive web map viewer. It is currently used in numerous Spatial Data Infrastructure initiatives across the world.
nbomicroscopymetadataspecs
The 4DN + BINA Tiered System of Microscopy Metadata Guidelines was developed by the 4DN Imaging Standards Working Group in collaboration with the BINA Data Management and Quality Control Working Group (https://www.bioimagingna.org/qc-dm-wg) for documenting microscopy experiments and assessing the quality of image data.
zowie
Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)
4dnmetadataschemaxsd2jsonconverter
This is a converter written in Java that translates an XSD microscopy metadata schema into JSON
canada-labour-research-assistant
The Canada Labour Research Assistant (CLaRA) is a privacy-first LLM-powered RAG AI assistant proposing Easily Verifiable Direct Quotations (EVDQ) to mitigate hallucinations in answering questions about Canadian labour laws, standards, and regulations. It works entirely offline and locally, guaranteeing the confidentiality of your conversations.
py3data
A flexible and lightweight Python interface to the re3data.org database
brep-part-finder
A Python package to identify the part ID number in Brep format CAD files
BiocPkgTools
Computable build reports, package metadata, and download stats from the Bioconductor project
scif
scientific filesystem: a filesystem organization for scientific software and metadata
csv-metadata-quality
A simple but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem
obofoundry.github.io
Metadata and website for the Open Bio Ontologies Foundry Ontology Registry
guano
Python reference implementation of the GUANO bat acoustics metadata specification
PHES-ODM
The Public Health Environmental Surveillance Open Data Model (PHES-ODM, or ODM). A data model, dictionary and support tools for environmental surveillance.
https://github.com/biopragmatics/biolookup
🔍 The Biolookup Service retrieves metadata and ontological information about biomedical entities.
NPSdataverse
NPSdataverse: a suite of R packages for data processing, authoring Ecological Metadata Language metadata, checking data-metadata congruence, and accessing data - Published in JOSS (2025)
https://github.com/annakrystalli/dataspice-tutorial
Tutorial in practical research data management: creating simple metadata files using dataspice
https://github.com/antarctica/metadata-library
Python library for generating ISO 19115 metadata records.
https://github.com/centrefordigitalhumanities/edpop
A virtual research environment (VRE) that lets you collect, align and annotate bibliographical and biographical records from several online catalogs.
https://github.com/cdcgov/cdh-lava-react
CDC Data Hub Lifecycle, Analysis & Visualization Accelerator (LAVA) REACT Components based on machine readable requirements.
metafinder
Search for documents in a domain through Search Engines (Google, Bing and Baidu). The objective is to extract metadata
https://github.com/annakrystalli/emodnet-dataspice-tutorial
An Introduction to using R package Dataspice to collect metadata for EMODnet Biology Data Products
https://github.com/catalyst-cooperative/pudl-zenodo-storage
Tools for creating versioned archives of raw data on Zenodo using Frictionless data packages.
https://github.com/cdli-gh/data
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
https://github.com/aidotse/multimodal-skin-lesion-classification
Mutlimodality for skin lesions classification
same-version
Automatically ensures your software version metadata is consistent across key project files.
openminds_matlab
A MATLAB toolbox for generating openMINDS-compliant linked metadata.
https://github.com/cidgoh/grdi_amr_one_health
A data specification for harmonizing One Health AMR pathogen genomics contextual data. The specification provides standardized (ontology-based) fields and terms which are implemented via a spreadsheet collection template, supported by field and reference guides as well as different curation and new term request SOPs.
https://github.com/colectica/ddiregistry
Registry software for the DDI Alliance Agency Registry
cd2-survey
Code and processed data from the Connecting Data in Child Development (CD2) Vocabularies survey
copo-documentation
COPO documentation was created using the Sphinx reStructuredText (reST) markup language which is hosted on readthedocs.io. The documentation uses Sphinx.
https://github.com/afuetterer/oaipmh-scythe
A Scythe for harvesting OAI-PMH repositories.
https://github.com/dataesr/works-magnet
Works-magnet: Retrieve and promote the scholarly works of your institution.
https://github.com/bigbio/sdrf-cellline-metadata-db
SDRF Cell Line Metadata Database
https://github.com/cldf/cldf
CLDF: Cross-Linguistic Data Formats - the specification
dataverseur
🫖 A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations
hermes_metadata_lesson
Data and Metadata in the Humanities: Structures, Schemas, and Standards
php-codemeta-crosswalk
Codemeta conversions - https://doi.org/10.5281/zenodo.12808884
https://github.com/cured-plus/csvw-duckdb
Convert a CSVW document (CSV metadata) to a DuckDB query to load a CSV file into a database.
omega
Open Microscopy Environment inteGrated Analysis (OMEGA) is a cross-platform data management, analysis and visualization system, for particle tracking data with particular emphasis on results from viral and vesicular trafficking experiments.
https://github.com/cityjson/cityjson-qgis-plugin
A QGIS plugin that adds support for CityJSON files