Recent Releases of concise-concepts
concise-concepts - added sense2vec, standalone and fuzzy matching support
added new features: - sense2vec support #26 - standalone support - fuzzy matching support #24 - binarized word2vec model support
minor bug fixes
- original data was over-written by expanded data => prior knowledge was only inferred once
- "keyword" and "wordkey" got mixed up for logging when verbose=True
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - resolved bug where original data was overwritten
Overwriting the original data disables re-inferring the original knowledge to the rest of the model.
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - added option for Standalone usage
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - added wildcard operator * to compund words patterns
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - resolve python 3.8 versioning error
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - bug fix case sensitive entity scoring
25 entity scoring did not work in combination with case sensitivity
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - bug fixes and efficiency updates by Tom Aarsen
- Resolve several breaking bugs.
- Perform a large optimization regarding string normalization and KeyedVector membership.
- Perform simple code optimization each localized to just a few lines.
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - small feature updates, bug fixes and documentation update
17 resolved an issue that caused duplicate logging for the same missing keys. Also introduced verbose option.
18 introduced json_path to export matching patterns to a custom path.
19 defaults to key if example words are not present in the embedding model.
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - added more extensive logging and better default parameters
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - resolved #13
Added correct character escaping Regex lowercase match Added correct n-gram join.
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - resolved #10 #11 custom matching patterns and Gensim model issues
11 added support for more custom matching patterns via 4 config variables.
- ´exclude_pos´: A list of POS tags to be excluded from the rule based match.
- ´exclude_dep´: A list of dependencies to be excluded from the rule based match.
- ´includecompoundwords´: If True, it will include compound words in the entity. For example, if the entity is "New York", it will also include "New York City" as an entity.
- ´case_sensitive´: Whether to match the case of the words in the text.
10 resolved an issue where gensim Word2Vec and FastText models were not processed as KeyedVectors. Hence, the model did not load due to mis'interpretting it as an iterable object.
Also unified code regarding checking whether string are present in a model.
Also made sure that n-grams models and word matches are supported.
- Python
Published by davidberenstein1957 over 3 years ago
concise-concepts - #9 resolved typo in key not present in dict logic
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - resolved #7 OOV words and keys
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - overcome KeyError OOV words
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - resolved bug in pytest regarding Gensim model loading
I introduced a small bug within try, excepts during model loading.
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - resolved Extension 'ent_score' already exists on Span.
First check whether the Span extension 'ent_score' is present. If so, remove it, upon initializing a new concise-concepts.
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - first minor release
- resolved dependencies rule matcher
- added entity scoring
- settle inference around concept
- add original data back to list after inference around concept
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - recurrent inference of examples to properly settle data around a concept
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - added support for entity confidence scoring and Fasttext models
- Python
Published by davidberenstein1957 about 4 years ago
concise-concepts - support for Gensim embedding
Added support for loading Gensim embeddings (custom model or Gensim pre-trained models) via configkey modelpath.
- Python
Published by davidberenstein1957 about 4 years ago