htmldate
htmldate: A Python package to extract publication dates from web pages - Published in JOSS (2020)
Generating CodeMeta Metadata for R Packages
Generating CodeMeta Metadata for R Packages - Published in JOSS (2017)
cffr
cffr: Generate Citation File Format Metadata for R Packages - Published in JOSS (2021)
cfdm
cfdm: A Python reference implementation of the CF data model - Published in JOSS (2020)
Metasyn
Metasyn: Transparent Generation of Synthetic Tabular Data with Privacy Guarantees - Published in JOSS (2025)
ck
Collective Knowledge (CK), Collective Mind (CM/CMX) and MLPerf automations: community-driven projects to facilitate collaborative and reproducible research and to learn how to run AI, ML, and other emerging workloads more efficiently and cost-effectively across diverse models, datasets, software, and hardware using MLPerf methodology and benchmarks
widoco
Wizard for documenting ontologies. WIDOCO is a step by step generator of HTML templates with the documentation of your ontology. It uses the LODE environment to create part of the template.
micrometaapp-react
This is the React implementation of Micro-Meta App, Microscopy Metadata for the real world!
micrometaapp-electron
Electron wrapped, stand-alone version of Micro-Meta App, microscopy metadata for the real world!
micrometaapp-omero
This is a limited-functionality, beta - implementation of the integration of Micro-Meta App in the OMERO.web client
oemetadata
The OEMetadata is an energy metadata standard including a metadata schema, templates, and examples.
qa-catalogue
QA Catalogue – a metadata quality assessment tool for library catalogue records (MARC, PICA, UNIMARC)
proteomics-sample-metadata
The Proteomics sample metadata: Standard for experimental design annotation in proteomics datasets
odis-arch
Development of the Ocean Data and Information System (ODIS) architecture
core-geonetwork
GeoNetwork is a catalog application to manage spatially referenced resources. It provides powerful metadata editing and search functions as well as an interactive web map viewer. It is currently used in numerous Spatial Data Infrastructure initiatives across the world.
nbomicroscopymetadataspecs
The 4DN + BINA Tiered System of Microscopy Metadata Guidelines was developed by the 4DN Imaging Standards Working Group in collaboration with the BINA Data Management and Quality Control Working Group (https://www.bioimagingna.org/qc-dm-wg) for documenting microscopy experiments and assessing the quality of image data.
zowie
Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)
4dnmetadataschemaxsd2jsonconverter
This is a converter written in Java that translates an XSD microscopy metadata schema into JSON
canada-labour-research-assistant
The Canada Labour Research Assistant (CLaRA) is a privacy-first LLM-powered RAG AI assistant proposing Easily Verifiable Direct Quotations (EVDQ) to mitigate hallucinations in answering questions about Canadian labour laws, standards, and regulations. It works entirely offline and locally, guaranteeing the confidentiality of your conversations.
py3data
A flexible and lightweight Python interface to the re3data.org database
brep-part-finder
A Python package to identify the part ID number in Brep format CAD files
BiocPkgTools
Computable build reports, package metadata, and download stats from the Bioconductor project
scif
scientific filesystem: a filesystem organization for scientific software and metadata
csv-metadata-quality
A simple but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpace ecosystem
obofoundry.github.io
Metadata and website for the Open Bio Ontologies Foundry Ontology Registry
guano
Python reference implementation of the GUANO bat acoustics metadata specification
PHES-ODM
The Public Health Environmental Surveillance Open Data Model (PHES-ODM, or ODM). A data model, dictionary and support tools for environmental surveillance.
https://github.com/biopragmatics/biolookup
🔍 The Biolookup Service retrieves metadata and ontological information about biomedical entities.
https://github.com/cdcgov/tostadas
🧬 💻 TOSTADAS → Toolkit for Open Sequence Triage, Annotation and DAtabase Submission
NPSdataverse
NPSdataverse: a suite of R packages for data processing, authoring Ecological Metadata Language metadata, checking data-metadata congruence, and accessing data - Published in JOSS (2025)
https://github.com/annakrystalli/dataspice-tutorial
Tutorial in practical research data management: creating simple metadata files using dataspice
https://github.com/antarctica/metadata-library
Python library for generating ISO 19115 metadata records.
https://github.com/centrefordigitalhumanities/edpop
A virtual research environment (VRE) that lets you collect, align and annotate bibliographical and biographical records from several online catalogs.
https://github.com/cdcgov/cdh-lava-react
CDC Data Hub Lifecycle, Analysis & Visualization Accelerator (LAVA) REACT Components based on machine readable requirements.
metafinder
Search for documents in a domain through Search Engines (Google, Bing and Baidu). The objective is to extract metadata
https://github.com/annakrystalli/emodnet-dataspice-tutorial
An Introduction to using R package Dataspice to collect metadata for EMODnet Biology Data Products
https://github.com/catalyst-cooperative/pudl-zenodo-storage
Tools for creating versioned archives of raw data on Zenodo using Frictionless data packages.
https://github.com/cdli-gh/data
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
copo-documentation
COPO documentation was created using the Sphinx reStructuredText (reST) markup language which is hosted on readthedocs.io. The documentation uses Sphinx.
https://github.com/actinia-org/actinia-metadata-plugin
Contains actinia communication with a metadata catalog via OGC-CSW, in usage with GeoNetwork opensource
https://github.com/cumbof/opengdc.py
Python implementation of the OpenGDC software https://github.com/cumbof/OpenGDC
https://github.com/cured-plus/csvw-duckdb
Convert a CSVW document (CSV metadata) to a DuckDB query to load a CSV file into a database.
empirical-retraction-lit
Empirical Retraction Lit bibliography content, website, and web maintenance info
hermes_metadata_lesson
Data and Metadata in the Humanities: Structures, Schemas, and Standards
dataverseur
🫖 A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations
https://github.com/cldf/cldf
CLDF: Cross-Linguistic Data Formats - the specification
omega
Open Microscopy Environment inteGrated Analysis (OMEGA) is a cross-platform data management, analysis and visualization system, for particle tracking data with particular emphasis on results from viral and vesicular trafficking experiments.
quarto-lua-env
lua-env is an extension for Quarto to provide access to LUA objects as metadata.
elmo-enhanced-laboratory-metadata-optimizer
Metadata editor for research data management of the GeoForschungsZentrum Potsdam
https://github.com/bigbio/sdrf-cellline-metadata-db
SDRF Cell Line Metadata Database
same-version
Automatically ensures your software version metadata is consistent across key project files.
https://github.com/colectica/ddiregistry
Registry software for the DDI Alliance Agency Registry
https://github.com/cidgoh/cancogen_contextual_data_specification
A data specification for harmonizing One Health Canadian COVID pathogen genomics contextual data. The specification provides standardized (ontology-based) fields and terms which are implemented via a the DataHarmonizer, supported by field and reference guides as well as different curation and new term request SOPs.
pia-data-model
Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project
bundesweiter_klinischer_krebsregisterdatensatz-datenschema_und_klassifikationen
Das Repository stellt Informationen zu Struktur und Klassifikationen des bundesweiten klinischen Krebsregisterdatensatzes bereit. Die verwendeten Klassifikationen bilden den derzeitigen Arbeitsstand des ZfKD ab. Ziel ist es, diesen Stand transparent bereit zu stellen und beteiligte Akteure zur gemeinsamen Harmonisierung von Standards einzuladen.
aiu
A library for interacting with web archive collections at Archive-It, Trove, Pandora, and more.
oeg-software-graph
Knowledge graph containing the catalog of software from the oeg-upm organization in GitHub
fair_ontologies
Code for FOOPS! the OEG FAIR assessment tool for ontologies and vocabularies
sum-buddy
Generate and save checksums for all (or certain) contents of given directory.
zoinks
Command-line utility to return Zotero record field values given a Zotero select link, an item key, or even just a file attachment
dirschema
Spec and validator for directories, files and metadata based on JSON Schema and regexes.