spacy-iwnlp

German lemmatization with IWNLP as extension for spaCy

https://github.com/liebeck/spacy-iwnlp

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.5%) to scientific vocabulary

Keywords

nlp spacy spacy-extension spacy-pipeline
Last synced: 7 months ago · JSON representation ·

Repository

German lemmatization with IWNLP as extension for spaCy

Basic Info
  • Host: GitHub
  • Owner: Liebeck
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 19.5 KB
Statistics
  • Stars: 24
  • Watchers: 4
  • Forks: 2
  • Open Issues: 1
  • Releases: 0
Topics
nlp spacy spacy-extension spacy-pipeline
Created about 8 years ago · Last pushed over 2 years ago
Metadata Files
Readme License Citation

README.md

spacy-iwnlp

license Build Status

This package uses the spaCy 3.0 extensions to add IWNLP-py as German lemmatizer directly into your spaCy pipeline.

Please report bugs with spacy-iwnlp as issue in IWNLP-py.

Usage

python import spacy from spacy_iwnlp import spaCyIWNLP nlp = spacy.load('de_core_news_sm') nlp.add_pipe('iwnlp', config={'lemmatizer_path': 'data/IWNLP.Lemmatizer_20181001.json'}) doc = nlp('Wir mögen Fußballspiele mit ausgedehnten Verlängerungen.') for token in doc: print('POS: {}\tIWNLP:{}'.format(token.pos_, token._.iwnlp_lemmas))

Installation

  1. Use pip to install spacy-iwnlp bash pip install spacy-iwnlp
  2. Download the latest processed IWNLP dump from https://dbs.cs.uni-duesseldorf.de/datasets/iwnlp/IWNLP.Lemmatizer_20181001.zip and unzip it.

Local development

Use develop.py to extend the functionality

Update PIP package

python setup.py sdist bdist_wheel python -m twine upload dist/PACKAGENAME-VERSION.tar.gz

Citation

Please include the following BibTeX if you use IWNLP in your work: bash @InProceedings{liebeck-conrad:2015:ACL-IJCNLP, author = {Liebeck, Matthias and Conrad, Stefan}, title = {{IWNLP: Inverse Wiktionary for Natural Language Processing}}, booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)}, year = {2015}, publisher = {Association for Computational Linguistics}, pages = {414--418}, url = {http://www.aclweb.org/anthology/P15-2068} }

Owner

  • Name: Matthias Liebeck
  • Login: Liebeck
  • Kind: user
  • Location: Düsseldorf, Germany
  • Company: Heinrich Heine University Düsseldorf

PhD in computer science. My research is currently focused on mining and analyzing text data.

Citation (CITATION.cff)

@InProceedings{liebeck-conrad:2015:ACL-IJCNLP,
  author    = {Liebeck, Matthias  and  Conrad, Stefan},
  title     = {{IWNLP: Inverse Wiktionary for Natural Language Processing}},
  booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
  year      = {2015},
  publisher = {Association for Computational Linguistics},
  pages     = {414--418},
  url       = {http://www.aclweb.org/anthology/P15-2068}
}

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Committers

Last synced: over 1 year ago

All Time
  • Total Commits: 35
  • Total Committers: 2
  • Avg Commits per committer: 17.5
  • Development Distribution Score (DDS): 0.029
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Matthias Liebeck L****k 34
Johann Petrak j****k@g****m 1

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 5
  • Total pull requests: 1
  • Average time to close issues: 12 months
  • Average time to close pull requests: about 3 years
  • Total issue authors: 5
  • Total pull request authors: 1
  • Average comments per issue: 1.6
  • Average comments per pull request: 2.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • Frobless (1)
  • sonja-brehm (1)
  • kylefoley76 (1)
  • alihashaam (1)
  • johann-petrak (1)
Pull Request Authors
  • johann-petrak (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 92 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 6
  • Total versions: 3
  • Total maintainers: 1
pypi.org: spacy-iwnlp

Integration of IWNLP-py as spaCy extension

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 6
  • Downloads: 92 Last month
Rankings
Dependent repos count: 6.0%
Dependent packages count: 10.1%
Stargazers count: 12.6%
Average: 14.3%
Forks count: 19.1%
Downloads: 23.6%
Maintainers (1)
Last synced: 8 months ago

Dependencies

.travis/requirements.txt pypi
  • iwnlp >=0.1.7
  • pytest ==3.0.6
  • pytest-pep8 ==1.0.6
requirements.txt pypi
  • iwnlp >=1.0.7
  • spacy >=2.0.0
setup.py pypi
  • iwnlp >=0.1.7
  • spacy >=2.0.0,<3.0.0