Scientific Software
Updated 6 months ago

TextDescriptives — Peer-reviewed • Rank 17.6 • Science 98%

TextDescriptives: A Python package for calculating a large variety of metrics from text - Published in JOSS (2023)

Updated 6 months ago

trafilatura • Rank 26.3 • Science 77%

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Updated 6 months ago

textstat • Rank 25.6 • Science 36%

:memo: python package to calculate readability statistics of a text object - paragraphs, sentences, articles.

Updated 5 months ago

https://github.com/alan-turing-institute/readabilipy • Rank 21.8 • Science 23%

A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.

Updated 5 months ago

https://github.com/brucewlee/prompt-learning-readability • Rank 1.4 • Science 26%

[EACL 2023] use text-to-text models (BART, T5) for readability assessment

Updated 6 months ago

dyslexic-readability • Science 44%

A readability scoring library tailored to the specific needs of Turkish dyslexic readers.