Scientific Software
Updated 6 months ago
pyuca
pyuca: a Python implementation of the Unicode Collation Algorithm - Published in JOSS (2016)
Scientific Software · Peer-reviewed
Updated 6 months ago
App-unichar
Perl tool to display Unicode characters by character, name, or code point
Updated 6 months ago
com.bkahlert.kommons:kommons
Kommons is a set of Kotlin Multiplatform Libraries (MPP) to allow the execution of command lines / scripts, to support print debugging and to ease testing.
Updated 6 months ago
@stdlib/regexp-whitespace
Return a regular expression to match a white space character.
Updated 4 months ago
https://github.com/google-research/nisaba
Finite-state script normalization and processing utilities
Updated 6 months ago
pyonmttok
Fast and customizable text tokenization library with BPE and SentencePiece support
Updated 6 months ago
stringi
Fast and portable character string processing in R (with the Unicode ICU)