spacy-iwnlp
German lemmatization with IWNLP as extension for spaCy
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.5%) to scientific vocabulary
Keywords
Repository
German lemmatization with IWNLP as extension for spaCy
Basic Info
Statistics
- Stars: 24
- Watchers: 4
- Forks: 2
- Open Issues: 1
- Releases: 0
Topics
Metadata Files
README.md
spacy-iwnlp
This package uses the spaCy 3.0 extensions to add IWNLP-py as German lemmatizer directly into your spaCy pipeline.
Please report bugs with spacy-iwnlp as issue in IWNLP-py.
Usage
python
import spacy
from spacy_iwnlp import spaCyIWNLP
nlp = spacy.load('de_core_news_sm')
nlp.add_pipe('iwnlp', config={'lemmatizer_path': 'data/IWNLP.Lemmatizer_20181001.json'})
doc = nlp('Wir mögen Fußballspiele mit ausgedehnten Verlängerungen.')
for token in doc:
print('POS: {}\tIWNLP:{}'.format(token.pos_, token._.iwnlp_lemmas))
Installation
- Use pip to install spacy-iwnlp
bash pip install spacy-iwnlp - Download the latest processed IWNLP dump from https://dbs.cs.uni-duesseldorf.de/datasets/iwnlp/IWNLP.Lemmatizer_20181001.zip and unzip it.
Local development
Use develop.py to extend the functionality
Update PIP package
python setup.py sdist bdist_wheel
python -m twine upload dist/PACKAGENAME-VERSION.tar.gz
Citation
Please include the following BibTeX if you use IWNLP in your work:
bash
@InProceedings{liebeck-conrad:2015:ACL-IJCNLP,
author = {Liebeck, Matthias and Conrad, Stefan},
title = {{IWNLP: Inverse Wiktionary for Natural Language Processing}},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
year = {2015},
publisher = {Association for Computational Linguistics},
pages = {414--418},
url = {http://www.aclweb.org/anthology/P15-2068}
}
Owner
- Name: Matthias Liebeck
- Login: Liebeck
- Kind: user
- Location: Düsseldorf, Germany
- Company: Heinrich Heine University Düsseldorf
- Website: https://www.liebeck.io
- Repositories: 18
- Profile: https://github.com/Liebeck
PhD in computer science. My research is currently focused on mining and analyzing text data.
Citation (CITATION.cff)
@InProceedings{liebeck-conrad:2015:ACL-IJCNLP,
author = {Liebeck, Matthias and Conrad, Stefan},
title = {{IWNLP: Inverse Wiktionary for Natural Language Processing}},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
year = {2015},
publisher = {Association for Computational Linguistics},
pages = {414--418},
url = {http://www.aclweb.org/anthology/P15-2068}
}
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1
Committers
Last synced: over 1 year ago
Top Committers
| Name | Commits | |
|---|---|---|
| Matthias Liebeck | L****k | 34 |
| Johann Petrak | j****k@g****m | 1 |
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 5
- Total pull requests: 1
- Average time to close issues: 12 months
- Average time to close pull requests: about 3 years
- Total issue authors: 5
- Total pull request authors: 1
- Average comments per issue: 1.6
- Average comments per pull request: 2.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- Frobless (1)
- sonja-brehm (1)
- kylefoley76 (1)
- alihashaam (1)
- johann-petrak (1)
Pull Request Authors
- johann-petrak (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 92 last-month
- Total dependent packages: 0
- Total dependent repositories: 6
- Total versions: 3
- Total maintainers: 1
pypi.org: spacy-iwnlp
Integration of IWNLP-py as spaCy extension
- Homepage: https://github.com/Liebeck/spacy-iwnlp
- Documentation: https://spacy-iwnlp.readthedocs.io/
- License: mit
-
Latest release: 3.0.0
published over 3 years ago
Rankings
Maintainers (1)
Dependencies
- iwnlp >=0.1.7
- pytest ==3.0.6
- pytest-pep8 ==1.0.6
- iwnlp >=1.0.7
- spacy >=2.0.0
- iwnlp >=0.1.7
- spacy >=2.0.0,<3.0.0