Scientific Software
Updated 6 months ago

pyuca — Peer-reviewed • Rank 20.2 • Science 93%

pyuca: a Python implementation of the Unicode Collation Algorithm - Published in JOSS (2016)

Scientific Software · Peer-reviewed
Updated 6 months ago

App-unichar • Rank 2.1 • Science 54%

Perl tool to display Unicode characters by character, name, or code point

Updated 6 months ago

@stdlib/string-next-grapheme-cluster-break • Rank 9.2 • Science 44%

Return the next extended grapheme cluster break in a string after a specified position.

Updated 6 months ago

com.bkahlert.kommons:kommons • Rank 9.2 • Science 44%

Kommons is a set of Kotlin Multiplatform Libraries (MPP) to allow the execution of command lines / scripts, to support print debugging and to ease testing.

Updated 6 months ago

@stdlib/string-prev-grapheme-cluster-break • Rank 3.6 • Science 44%

Return the previous extended grapheme cluster break in a string before a specified position.

Updated 6 months ago

sixarm_unicode_emoji • Rank 0.7 • Science 44%

SixArm.com Unicode Emoji Examples

Updated 6 months ago

pyonmttok • Rank 18.5 • Science 26%

Fast and customizable text tokenization library with BPE and SentencePiece support

Updated 6 months ago

stringx • Rank 9.2 • Science 26%

Drop-in replacements for base R string functions powered by stringi

Updated 6 months ago

@stdlib/string-base-remove-first-grapheme-cluster • Science 44%

Remove the first grapheme cluster (i.e., user-perceived character) of a string.

Updated 5 months ago

https://github.com/bkahlert/kommons-debug • Science 18%

Kotlin Multiplatform Library with print debugging, Unicode and other features you did not know you were missing

Updated 6 months ago

@stdlib/string-next-code-point-index • Science 44%

Return the position of the next Unicode code point in a string after a specified position.

Updated 6 months ago

@stdlib/string-base-for-each-code-point-right • Science 44%

Invoke a function for each Unicode code point in a string, iterating from right to left.

Updated 6 months ago

string-base-slice-grapheme-clusters • Science 44%

Slice a string based on grapheme cluster (i.e., user-perceived character) indices.

Updated 6 months ago

@stdlib/string-base-for-each-grapheme-cluster • Science 44%

Invoke a function for each grapheme cluster (i.e., user-perceived character) in a string.

Updated 6 months ago

stringi • Science 49%

Fast and portable character string processing in R (with the Unicode ICU)

Updated 6 months ago

@stdlib/string-base-truncate-middle • Science 44%

Truncate the middle UTF-16 code units of a string to return a string having a specified length.

Updated 6 months ago

@stdlib/string-base-remove-last-grapheme-cluster • Science 44%

Remove the last grapheme cluster (i.e., user-perceived character) of a string.

Updated 6 months ago

uast • Science 67%

Unicode Aware Saṃskṛta Transliteration (https://arxiv.org/abs/2203.14277)

Updated 6 months ago

@stdlib/string-base-first-grapheme-cluster • Science 44%

Return the first grapheme cluster (i.e., user-perceived character) of a string.