iwnlp

IWNLP: A parser for the German edition of Wiktionary

https://github.com/liebeck/iwnlp

Science Score: 31.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.0%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

IWNLP: A parser for the German edition of Wiktionary

Basic Info
  • Host: GitHub
  • Owner: Liebeck
  • License: other
  • Language: C#
  • Default Branch: master
  • Homepage: http://www.iwnlp.com/
  • Size: 767 KB
Statistics
  • Stars: 13
  • Watchers: 3
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 11 years ago · Last pushed almost 3 years ago
Metadata Files
Readme License Citation

README.md

Build status

IWNLP

IWNLP is a dictionary-based lemmatizer for the German language. It is based on the German edition of Wiktionary. IWNLP consists of two parts: * IWNLP: A parser for the German edition of Wiktionary * IWNLP.Lemmatizer: A German lemmatizer that uses the output from IWNLP to produce a mapping from an inflected form to a lemma.

More details can be found at www.iwnlp.com
We also provide a Python implementation for the lemmatizer: IWNLP-py

How to run IWNLP

  • Clone the project and build it
  • Download https://dumps.wikimedia.org/dewiktionary/latest/dewiktionary-latest-pages-articles.xml.bz2
  • Unpack dewiktionary-latest-pages-articles.xml.bz2
  • Start IWNLP.Parser.exe with two parameters: Path to the unzipped file, path to the export file. For instance bash IWNLP.Parser.exe "c:\\dewiktionary-latest-pages-articles.xml" "c:\\parsedIWNLP_latest.xml"
  • Follow the steps from IWNLP.Lemmatizer to create the lemmatization mapping

Citation

Please include the following BibTeX if you use IWNLP in your work: bash @InProceedings{liebeck-conrad:2015:ACL-IJCNLP, author = {Liebeck, Matthias and Conrad, Stefan}, title = {{IWNLP: Inverse Wiktionary for Natural Language Processing}}, booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)}, year = {2015}, publisher = {Association for Computational Linguistics}, pages = {414--418}, url = {http://www.aclweb.org/anthology/P15-2068} }

Owner

  • Name: Matthias Liebeck
  • Login: Liebeck
  • Kind: user
  • Location: Düsseldorf, Germany
  • Company: Heinrich Heine University Düsseldorf

PhD in computer science. My research is currently focused on mining and analyzing text data.

Citation (CITATION.cff)

@InProceedings{liebeck-conrad:2015:ACL-IJCNLP,
  author    = {Liebeck, Matthias  and  Conrad, Stefan},
  title     = {{IWNLP: Inverse Wiktionary for Natural Language Processing}},
  booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
  year      = {2015},
  publisher = {Association for Computational Linguistics},
  pages     = {414--418},
  url       = {http://www.aclweb.org/anthology/P15-2068}
}

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 28
  • Total pull requests: 3
  • Average time to close issues: 25 days
  • Average time to close pull requests: less than a minute
  • Total issue authors: 2
  • Total pull request authors: 1
  • Average comments per issue: 0.11
  • Average comments per pull request: 0.0
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • Liebeck (23)
  • p-sodmann (1)
Pull Request Authors
  • Liebeck (2)
Top Labels
Issue Labels
wontfix (1)
Pull Request Labels