Projects with codemeta.json

Updated 11 months ago

pythainlp • Rank 24.6 • Science 67%

Thai natural language processing in Python

computational-linguistics hacktoberfest natural-language-processing nlp-library python soundex text-processing thai thai-language thai-nlp thai-nlp-library thai-soundex word-segmentation

Updated 11 months ago

ucto • Rank 9.3 • Science 26%

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

computational-linguistics folia language natural-language-processing nlp punctuation tokeniser

Updated 11 months ago

libfolia • Rank 7.4 • Science 26%

FoLiA library for C++

folia library natural-language-processing nlp

Updated 11 months ago

frog • Rank 7.1 • Science 26%

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

computational-linguistics dependency-parser dutch folia lemmatiser morphological-analyser morphology named-entity-recognition natural-language-processing nlp pos-tagger syntax text-processing

Updated 11 months ago

mbt • Rank 6.8 • Science 26%

MBT: Memory-based tagger generation and tagging MBT is a memory-based tagger-generator and tagger in one.

c-plus-plus machine-learning natural-language-processing nlp tagger timbl

Updated 11 months ago

colloquery • Rank 1.1 • Science 26%

Web application for searching for phrases/collocations/synonyms in phrase translation tables

computational-linguistics machine-translation mt natural-language-processing nlp

Updated 11 months ago

pgrdup • Science 67%

Discover Probable Duplicates in Plant Genetic Resources Collections

double-metaphone double-metaphone-algorithm natural-language-processing pgr plant-genetic-resources r record-linkage

Updated 11 months ago

awesome-ai4lam • Science 49%

A list of awesome AI in libraries, archives, and museum collections from around the world 🕶️

ai ai-in-libraries artificial-intelligence awesome awesome-list awesome-lists awesome-resources libraries library-automation library-systems machine-learning machine-learning-projects machinelearning ml-in-libraries natural-language-processing

Updated 11 months ago

the-corpus-as-a-network • Science 54%

Turning source documents into a graph with NLP

named-entity-recognition natural-language-processing social-network-analysis

Updated 11 months ago

nederlab-pipeline • Science 26%

Linguistic enrichment pipeline for historical dutch, as used in the Nederlab project

dutch historical-dutch historical-linguistics natural-language-processing nederlab nextflow nlp workflow

Updated 11 months ago

pkgmatch • Science 26%

Find R packages matching either descriptions or other R packages

embeddings llms natural-language-processing r

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

pythainlp • Rank 24.6 • Science 67%

ucto • Rank 9.3 • Science 26%

libfolia • Rank 7.4 • Science 26%

frog • Rank 7.1 • Science 26%

mbt • Rank 6.8 • Science 26%

colloquery • Rank 1.1 • Science 26%

pgrdup • Science 67%

awesome-ai4lam • Science 49%

the-corpus-as-a-network • Science 54%

nederlab-pipeline • Science 26%

pkgmatch • Science 26%