Science Score: 31.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.0%) to scientific vocabulary
Repository
IWNLP: A parser for the German edition of Wiktionary
Basic Info
- Host: GitHub
- Owner: Liebeck
- License: other
- Language: C#
- Default Branch: master
- Homepage: http://www.iwnlp.com/
- Size: 767 KB
Statistics
- Stars: 13
- Watchers: 3
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
IWNLP
IWNLP is a dictionary-based lemmatizer for the German language. It is based on the German edition of Wiktionary. IWNLP consists of two parts: * IWNLP: A parser for the German edition of Wiktionary * IWNLP.Lemmatizer: A German lemmatizer that uses the output from IWNLP to produce a mapping from an inflected form to a lemma.
More details can be found at www.iwnlp.com
We also provide a Python implementation for the lemmatizer: IWNLP-py
How to run IWNLP
- Clone the project and build it
- Download https://dumps.wikimedia.org/dewiktionary/latest/dewiktionary-latest-pages-articles.xml.bz2
- Unpack dewiktionary-latest-pages-articles.xml.bz2
- Start IWNLP.Parser.exe with two parameters: Path to the unzipped file, path to the export file. For instance
bash IWNLP.Parser.exe "c:\\dewiktionary-latest-pages-articles.xml" "c:\\parsedIWNLP_latest.xml" - Follow the steps from IWNLP.Lemmatizer to create the lemmatization mapping
Citation
Please include the following BibTeX if you use IWNLP in your work:
bash
@InProceedings{liebeck-conrad:2015:ACL-IJCNLP,
author = {Liebeck, Matthias and Conrad, Stefan},
title = {{IWNLP: Inverse Wiktionary for Natural Language Processing}},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
year = {2015},
publisher = {Association for Computational Linguistics},
pages = {414--418},
url = {http://www.aclweb.org/anthology/P15-2068}
}
Owner
- Name: Matthias Liebeck
- Login: Liebeck
- Kind: user
- Location: Düsseldorf, Germany
- Company: Heinrich Heine University Düsseldorf
- Website: https://www.liebeck.io
- Repositories: 18
- Profile: https://github.com/Liebeck
PhD in computer science. My research is currently focused on mining and analyzing text data.
Citation (CITATION.cff)
@InProceedings{liebeck-conrad:2015:ACL-IJCNLP,
author = {Liebeck, Matthias and Conrad, Stefan},
title = {{IWNLP: Inverse Wiktionary for Natural Language Processing}},
booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)},
year = {2015},
publisher = {Association for Computational Linguistics},
pages = {414--418},
url = {http://www.aclweb.org/anthology/P15-2068}
}
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 28
- Total pull requests: 3
- Average time to close issues: 25 days
- Average time to close pull requests: less than a minute
- Total issue authors: 2
- Total pull request authors: 1
- Average comments per issue: 0.11
- Average comments per pull request: 0.0
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- Liebeck (23)
- p-sodmann (1)
Pull Request Authors
- Liebeck (2)