Releases | Open Source Science

lexikanon - v0.6.5

Fix

dependencies: Update python and hyfi versions (d55c830)

- Python
Published by github-actions[bot] over 2 years ago

lexikanon - v0.6.4

Fix

tests: Update HyFI method calls and behavior (c55189a)
Makefile: Enable trust flag for copier command (1717647)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.6.3

Fix

dependencies: Upgrade hyfi to 1.29.8 (7747ddb)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.6.2

Fix

book: Update library description, add Zenodo link (b55f063)

Documentation

readme: Add Zenodo DOI badge and link, update library description (5513505)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.6.1

Fix

tokenizers: Add model validator after method (cfd8a06)
normalizer: Change unescape_html type to Unionbool, str
tokenizer: Add formal_en normalizer to nltk config (b65d01a)
stopwords: Add verbose condition to logging (a7f168b)
lexikanon: Adjust NLTKTagger for tagsets and default tag (54ba18f)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.6.0

Feature

tokenizer: Add additional postags (fe95c3e)
tokenizer: Add additional postags to nltk config (a35f684)
tokenizers/nltk: Add language support, improve tagset flexibility, download universal_tagset (82e2514)
lexikanon: Add new nltk_universal configuration file (2c7880f)
tokenizer: Add nltk_universal configuration (e68c5e1)
tokenizers: Add MecabTagger and NLTKTagger (b766fb3)
tokenizer/tagger: Implement NLTKTagger (4f2d945)

Fix

tokenizer: Add punctuation postags to mecab.yaml (1624dbd)
tokenizers: Adjust tokenizer base configurations (c96bfbf)
MecabTagger: Correct config_group path (9b74064)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.5.2

Fix

dependencies: Upgrade hyfi to 1.20.0 (1b6c402)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.5.1

Fix

stopwords: Simplify loading and accessing stopwords (b4628f9)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.5.0

Feature

lexikanon: Add findsimilardocsbyclustering configuration (51ddf56)
lexikanon: Add findsimilardocsbyclustering configuration (e2122cf)
lexikanon: Add similarity.py for document similarity analysis (b1fc21b)
pyproject.toml: Add scikit-learn dependency (7dd0014)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.4.3

Fix

tokenizers: Add Tokenizer to lexikanon tokenizers (66712af)
lexikanon: Change tokenizerconfigname to tokenizer (dc014bc)
lexikanon/pipe/tokenize: Enhance tokenizer function to support string or dict types (58e32f9)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.4.2

Fix

tokenizer: Change target from corprep to lexikanon in nltk.yaml (27e6915)
.envrc: Add new environment configuration file (13eeba2)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.4.1

Fix

lexikanon: To bump version (b929758)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.4.0

Feature

nltk: Add config group and name to NLTKTagger (fd5ba82)
mecab: Add configgroup_ and configname_ fields in MecabTagger (75937aa)
lexikanon: Add configgroup_ and configname_ to Tokenizer class (5ae39cb)
normalizer: Add config group and config name attributes to classes (f55ea51)
tokenizer/tagger: Add configgroup_ and configname_ in mecab.yaml and nltk.yaml (ed1be7c)
tokenizer: Add configname_ in tokenizer configuration files (5a3d79b)
lexikanon: Add configname_ in normalizer files (989d354)

Fix

dependencies: Upgrade hyfi to 1.12.5 (0113560)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.3.2

Fix

To force bumping version (7c973d7)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.3.1

Fix

lexikanon: Simplify YAML configuration files (55def1f)
dependencies: Upgrade hyfi to ^1.11.0 (1ea8090)
lexikanon: Add 'numheads' and 'numtails' options in 'datasetextracttokens.yaml', 'datasettokenize.yaml', 'tokenize.py' and 'extracttokens' (07dec64)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.3.0

Feature

lexikanon/pipe: Add new tokenize module (307d2a7)
lexikanon/pipe: Add new init.py file (f824369)
lexikanon: Add datasetextractnouns configuration (e175605)
lexikanon: Add datasetextracttokens.yaml configuration (f024732)
lexikanon: Add new tokenizer configuration in the dataset_tokenize.yaml file (02f4044)

Fix

lexikanon: Update logging and data display in tokenize functions (27b22e7)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.2.4

Fix

stopwords: Add special methods for Stopwords (e1b8871)
stopwords: Separate stopwords function and list, enhance logging control (fe2cf32)
stopwords: Rename configuration variables (084f679)
dependencies: Upgrade hyfi to 1.9.4 (7a0bc54)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.2.3

Fix

lexikanon: Change package_name to package_path in HyFI.initializeglobalhyfi args (0b93f1a)
dependencies: Upgrade hyfi to 1.9.3 (beb09bb)
dependencies: Upgrade ekonlpy to 2.0.2 (4215068)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.2.2

Fix

tokenizer: Change SimpleTokenizer path in config (13d4ea0)
tokenizer: Change MecabTokenizer import path (edc6e8d)
stopwords: Update target path (da93a48)
normalizer: Update target to lexikanon.normalizers.Normalizer (c33e060)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.2.1

Fix

dependencies: Upgrade hyfi to 1.9.0 (e91bf8d)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.2.0

Feature

tests: Add stopwords test in lexikanon module (32be6ae)
tests: Add new test cases in test_tokenizer.py (f4f6eb8)
tests: Add normalizer test case in lexikanon (ef1d8c9)
lexikanon/utils/hanja: Add table loading functionality (e3f14ee)
lexikanon: Add hanja translation support (36df26b)
hangul: Add support for Hangul character operations (130c699)
lexikanon/utils/hanja: Add new translation functionality (d35534b)
lexikanon/utils: Add hangle utilities to handle korean language (dba4474)
lexikanon/utils: Add new util file with various text processing functions (6fd05ec)
tokenizers: Add SimpleTokenizer, MecabTokenizer, NLTKTokenizer (6154fa0)
tokenizers: Add NLTKTokenizer and NLTKTagger classes (44aacd0)
tokenizers: Add MecabTokenizer and MecabTagger classes (23dec65)
lexikanon/tokenizers: Add base tokenizer methods (dc826d1)
stopwords: Add Stopwords class (3ae64af)
lexikanon/resources/dictionaries/mecab: Add new ekon_v1 dictionary file (5a0cb8a)
lexikanon/normalizers: Add new file normalizer.py with Normalizer class and associated configurations (94fda7f)
lexikanon/normalizers: Add Normalizer (342b781)
lexikanon: Add new tokenizer configuration (2c3ce80)
tokenizer: Add nltk configuration files for tokenization and tagging (e700d9b)
tokenizer: Add configuration for mecab tokenizer (05caee4)
tokenizer: Add new tokenizer configuration file (ad9c0c2)
stopwords: Add new stopwords configuration file (d403295)
normalizer: Add new files for various character settings (2568cdd)
dependencies: Add ftfy, nltk and ekonlpy (6b68952)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.1.4

Fix

pyproject.toml: Update package version and dependency versions (0f40b84)

- Python
Published by github-actions[bot] almost 3 years ago

lexikanon - v0.1.3

Fix

Apply updated template (37a9567)

- Python
Published by github-actions[bot] about 3 years ago

lexikanon - v0.1.2

Fix

version: Disable scm-version (c929957)

- Python
Published by github-actions[bot] about 3 years ago

lexikanon - v0.1.1

Fix

version: Add pre-commit command to make scm-version (fd61706)

- Python
Published by github-actions[bot] about 3 years ago

lexikanon - v0.1.0

Feature

Initial version (eb58046)

- Python
Published by github-actions[bot] about 3 years ago

Recent Releases of lexikanon

lexikanon - v0.6.5

Fix

lexikanon - v0.6.4

Fix

lexikanon - v0.6.3

Fix

lexikanon - v0.6.2

Fix

Documentation

lexikanon - v0.6.1

Fix

lexikanon - v0.6.0

Feature

Fix

lexikanon - v0.5.2

Fix

lexikanon - v0.5.1

Fix

lexikanon - v0.5.0

Feature

lexikanon - v0.4.3

Fix

lexikanon - v0.4.2

Fix

lexikanon - v0.4.1

Fix

lexikanon - v0.4.0

Feature

Fix

lexikanon - v0.3.2

Fix

lexikanon - v0.3.1

Fix

lexikanon - v0.3.0

Feature

Fix

lexikanon - v0.2.4

Fix

lexikanon - v0.2.3

Fix

lexikanon - v0.2.2

Fix

lexikanon - v0.2.1

Fix

lexikanon - v0.2.0

Feature

lexikanon - v0.1.4

Fix

lexikanon - v0.1.3

Fix

lexikanon - v0.1.2

Fix

lexikanon - v0.1.1

Fix

lexikanon - v0.1.0

Feature