Recent Releases of sense2vec
sense2vec - v1.0.2: Fix deserialization of components
🔴 Bug fixes
- Add defaults for config if attributes are not included in saved model.
- Fix serialization and deserialization of string store in component.
- Python
Published by ines over 6 years ago
sense2vec - v1.0.1: Fix caching bug
🔴 Bug fixes
- Fix bug that'd cause the scores to not be read correctly from precomputed
most_similarcaches.
- Python
Published by ines over 6 years ago
sense2vec - v1.0.0: Refactor and modernize, spaCy v2.2 support, more features, 2019 vectors model & Prodigy recipes
✨ New features and improvements
- Completely rewrite package from scratch.
- Replace built-in vector storage with spaCy's
Vectors, making this package a pure Python package and allowing easy out-of-the-box serialization of vectors. - Add fully serializable spaCy pipeline component and extension attributes.
- Add new methods
get_best_senseandget_other_sensesand improvemost_similar. - Add script for precomputing index of nearest neighbors for super fast "most similar" queries.
- Add annotation recipes for Prodigy to easily create word lists and match patterns from similar phrases using sense2vec vectors (like the
terms.teachrecipe, just with multi-word expressions). - New and more efficient training and preprocessing scripts using GloVe and fastText.
⚠️ Backwards incompatibilities
- The
sense2vec.loadmethod has been removed. UseSense2Vec.from_diskinstead. - The previous
VectorMapandVectorStoragehave been removed. - This package now requires Python 3.6+.
- This update requires a new vectors format (see attached files).
📖 Documentation and examples
- Rewrite
READMEfrom scratch and include full API docs.
👥 Contributors
Thanks to @kabirkhan for contributing the initial Prodigy recipes!
- Python
Published by ines over 6 years ago
sense2vec - v1.0.0a2: Refactor and modernize, spaCy v2.2 support, more features, Prodigy recipes
⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.
bash pip install sense2vec==1.0.0a2The converted Reddit vectors (trained on all comments of 2015) are attached to this release as a.tar.gzfile. For more details and usage instructions, see theREADME.
✨ New features and improvements
- Completely rewrite package from scratch.
- Replace built-in vector storage with spaCy's
Vectors, making this package a pure Python package and allowing easy out-of-the-box serialization of vectors. - Add fully serializable spaCy pipeline component and extension attributes.
- Add new methods
get_best_senseandget_other_sensesand improvemost_similar. - Add annotation recipes for Prodigy to easily create word lists and match patterns from similar phrases using sense2vec vectors (like the
terms.teachrecipe, just with multi-word expressions). - New and more efficient training and preprocessing scripts using GloVe.
⚠️ Backwards incompatibilities
- The
sense2vec.loadmethod has been removed. UseSense2Vec.from_diskinstead. - The previous
VectorMapandVectorStoragehave been removed. - This package now requires Python 3.6+.
- This update requires a new vectors format (see attached
.tar.gz).
📖 Documentation and examples
- Rewrite
READMEfrom scratch and include full API docs.
👥 Contributors
Thanks to @kabirkhan for contributing the Prodigy recipes!
- Python
Published by ines over 6 years ago
sense2vec - v1.0.0a1: Update sense2vec for spaCy v2.1.x or standalone use
⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.
bash pip install sense2vec==1.0.0a1Note that the library doesn't depend on spaCy anymore, so you might have to install spaCy and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a.tar.gzfile. For more details and usage instructions, see theREADME.
✨ New features and improvements
- NEW: Remove spaCy dependency and allow standalone use of the
sense2veclibrary. - NEW: Include spaCy v2.x pipeline component to add sense2vec-compatible token merging and token attributes and methods.
- Attach
reddit_vectorsmodel to release and make it easier to download and load in models.
📖 Documentation and examples
- Rewrite
READMEfrom scratch and include full API docs.
🚧 Todo
- [ ] Replace
VectorMapimplementation with spaCy'sVectorsclass. - [ ] Don't merge tokens at runtime and adjust extension attributes accordingly.
- [ ] Update training and pre-processing scripts for spaCy v2.x.
- [ ] Retrain vectors on more data.
- Python
Published by ines over 6 years ago
sense2vec - v1.0.0a0: Update sense2vec for spaCy v2.x or standalone use
⚠️ This is an alpha release and not yet ready for production. You can download sense2vec via pip by specifying the exact version.
bash pip install sense2vec==1.0.0a0Note that the library doesn't depend on spaCy anymore, so you might have to install spaCy and the English model separately. The Reddit vectors (trained on all comments of 2015) are attached to this release as a.tar.gzfile. For more details and usage instructions, see theREADME.
✨ New features and improvements
- NEW: Remove spaCy dependency and allow standalone use of the
sense2veclibrary. - NEW: Include spaCy v2.x pipeline component to add sense2vec-compatible token merging and token attributes and methods.
- Attach
reddit_vectorsmodel to release and make it easier to download and load in models.
📖 Documentation and examples
- Rewrite
READMEfrom scratch and include full API docs.
🚧 Todo
- [ ] Update training and pre-processing scripts for spaCy v2.x.
- Python
Published by ines almost 8 years ago