Pubmed Parser
Pubmed Parser: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset XML Dataset - Published in JOSS (2020)
Building, Importing, and Exporting GEXF Graph Files with rgexf
Building, Importing, and Exporting GEXF Graph Files with rgexf - Published in JOSS (2021)
externdata
:page_facing_up: Modelica library for data I/O of CSV, INI, JSON, MATLAB MAT, SSV, TIR, Excel XLS/XLSX and XML files
vs.xml
Special-purpose standalone XML parser, tree builder, and query engine for modern C++
folia
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.
odin
Data-structure definition/validation/traversal, mapping and serialisation toolkit for Python
https://github.com/cicirello/generate-sitemap
Generate an XML sitemap for a GitHub Pages site using GitHub Actions
https://github.com/aurora-network-global/sdg-queries
This repository contains machine readable (xml) search queries (crafted from a controlled vocabulary), for the Scopus publication database, to find domain specific research output that are related to the 17 Sustainable Development Goals (SDGs). We invite enyone to improve the SDG queries further in a co-creation process.
pubchunks
:warning: ARCHIVED :warning: Get chunks of XML format scholarly articles
https://github.com/dcavar/elan2split
Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners
seansaudiodb
The personal version of my audio collection database. Not intended for public use. See the other version that is intended for public use: https://github.com/seanpm2001/AudiBass_Manager
eno-flex
Repository of custom R codes to wrangle .lift and .flexttext output of FLEx into a new SFM database to be re-imported into a new FLEx dictionary/lexicon project. The spinoff of this project is available at https://github.com/engganolang/eno-learner-lift
grobid-datacat-trainingdata
Training datasets for GROBID sale catalogues models.
dts-typescript
Distributed Text Services (DTS) API for the TEI/XML files available in the Kouigenji Monogatari Text DB
teigarage
EGE RESTful web service. Provides EGE functionality through RESTful web service way.
https://github.com/qax-os/excelize
Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets
ediarum.prohd.edit
Last public release of ediarum.PROHD.edit for Proyecto Humboldt Digital including localization into Spanish
rus-novel-desktop-app
Десктопное приложение для создания размеченных файлов корпуса русского романа 📖
https://github.com/cedergrouphub/limesoup
LimeSoup is a package to parse HTML or XML papers from different publishers.
https://github.com/arfc/transition-scenarios
A repository to hold transition scenarios with Cyclus.
https://github.com/conal-tuohy/xproc-z
A platform for running XProc pipelines as web applications in a Java servlet container
edh_etl
This repository contains scripts for accessing, extracting and transforming epigraphic datasets from the Epigraphic Database Heidelberg (https://edh.ub.uni-heidelberg.de/) in a reproducible manner.
sixarm_ruby_xml_load
SixArm.com » Ruby » XML#load methods to load documents, elements, attributes
sixarm_ruby_xml_strip
SixArm.com » Ruby » XML#strip methods to clean XML & HTML
wolfsoftware.data-converter
A data converter package to convert data between JSON, YAML and XML.