Pubmed Parser
Pubmed Parser: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset XML Dataset - Published in JOSS (2020)
fortran-src
fortran-src: Fortran static analysis infrastructure - Published in JOSS (2025)
f90nml - A Python module for Fortran namelists
f90nml - A Python module for Fortran namelists - Published in JOSS (2019)
rdflib
RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.
yadg
yadg: yet another datagram - Published in JOSS (2022)
ms3
ms3: A parser for MuseScore files, serving as data factory for annotated music corpora - Published in JOSS (2023)
Nominally
Nominally: A Name Parser for Record Linkage - Published in JOSS (2021)
General binary file parser.
General binary file parser. - Published in JOSS (2018)
gnparser
GNparser normalises scientific names and extracts their semantic elements.
fr.inria.gforge.spoon:spoon-core
Spoon is a metaprogramming library to analyze and transform Java source code. :spoon: is made with :heart:, :beers: and :sparkles:. It parses source files to build a well-designed AST with powerful analysis and transformation API.
https://github.com/markedjs/marked
A markdown parser and compiler. Built for speed.
org.webjars.npm:nearley
📜🔜🌲 Simple, fast, powerful parser toolkit for JavaScript.
tree-sitter-ssh-client-config
tree-sitter grammar for SSH client configuration files
rigolwfm
Parsers for .wfm binary files created by a wide range of Rigol oscilloscopes
textx
Domain-Specific Languages and parsers in Python made easy http://textx.github.io/textX/
generic_parser
A parser for arguments and config files that also allows direct Python input and recursive parsing
url-search-params
`url-search-params` provides ability to create search params (query string) from HashMap and vice versa.
PDDL
Julia parser, interpreter and compiler interface for the Planning Domain Definition Language (PDDL). Planners not included.
forensicsim
A forensic open-source parser module for Autopsy that allows extracting the messages, comments, posts, contacts, calendar entries and reactions from a Microsoft Teams IndexedDB LevelDB database.
MonkeyLang
"Writing an Interpreter in GO" and "Writing a Compiler in GO" in Julia.
splitp
Python package that implements split- and rank-based tools for inferring phylogenies, such as flattenings and subflattenings.
fast-matrix-market
Fast and full-featured Matrix Market I/O library for C++, Python, and R
https://github.com/barrust/mediawiki
MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/
https://github.com/ami-iit/rod
The ultimate Python tool for RObot Descriptions processing.
https://github.com/bramvanroy/spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
https://github.com/fanchengyan/myst-sphinx-gallery
A Sphinx extension that builds galleries from myst-style markdown, notebook or rst files.
https://github.com/andstor/latex-math-parser
:wrench: Parser for parsing LaTeX math expressions
https://github.com/dcavar/treebankparser
Parser for treebanks based on Penn Treebank type of encoding that generates Probabilistic Context Free Grammars
https://github.com/althonos/opticaldisc
Read optical media filesystems with Rust
https://github.com/aggrathon/rustcalculator
A commandline calculator written in Rust
gov.nasa.pds:pds3-product-tools
Library supporting the design/generation, validation and submission of PDS3 archival products.
https://github.com/cedrickchee/hou
Hou :monkey: programming language interpreter and compiler
https://github.com/bytedance/dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
https://github.com/vincentlaucsb/csv-parser
A high-performance, fully-featured CSV parser and serializer for modern C++.
https://github.com/danielgatis/go-vte
A library to parse terminal escape sequences (almost) exactly how the real hardware does.
treeio
:seedling: Base Classes and Functions for Phylogenetic Tree Input and Output
Constructs
[WIP] A declarative deserialization-serialization for binary data. Inspired by Construct.
https://github.com/michaelhatherly/lexbor.jl
Julia wrapper for https://github.com/lexbor/lexbor
https://github.com/cedergrouphub/limesoup
LimeSoup is a package to parse HTML or XML papers from different publishers.
https://github.com/bamresearch/masterdata-parser-example
An example parser for openBIS using the bam-masterdata interface.
url-build-parse
`url-build-parse` provides the ability to parse URL from string as well as construct URL from parts.
https://github.com/xinmengbcr/moltopolparser
A lightweight package to parse and process molecular simulation files