https://github.com/dcavar/thec_rus
The Hoosier Ellipsis Corpus (THEC) - Russian Sub-corpus (thec_rus)
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 2 committers (50.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.7%) to scientific vocabulary
Repository
The Hoosier Ellipsis Corpus (THEC) - Russian Sub-corpus (thec_rus)
Basic Info
- Host: GitHub
- Owner: dcavar
- License: apache-2.0
- Language: Jupyter Notebook
- Default Branch: main
- Size: 275 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
The Hoosier Ellipsis Corpus (THEC) - Russian Sub-corpus (thec_rus)
(C) 2024 NLP-Lab
Created: 2024-04-10 by Damir Cavar
Last change: 2024-05-14 by Damir Cavar
More details about the Hoosier Ellipsis Corpus can be found on the NLP-Lab pages. The GitHub repo contains links to other languages and useful code and scripts for data processing.
This repo contains the Russian Ellipsis Sub-corpus of THEC.
Consult the data format specification for details about the structure of the files and the annotation standard used.
Maintainer
- Van Holthenrichs
- Damir Cavar
Citation
Please use the following snippet to cite our work.
Cavar, Damir and V. Holthenrichs (2024) NLP Corpus of Ellipsis: Modeling Ellipsis in Slavic. Paper presented at the Formal Approaches to Slavic Linguistics (FASL) 33. Halifax, Canada.
and
```bibtex @misc{cavar-holthenrichs-2024, author = {Cavar, Damir and Van Holthenrichs}. year = {2024}, title = {NLP Corpus of Ellipsis: Modeling Ellipsis in Slavic}, note = {Paper presented at the Formal Approaches to Slavic Linguistics (FASL) 33}, address = {Halifax, Canada} }
@inproceedings{cavar-etal-2024-typology, title = "The Typology of Ellipsis: A Corpus for Linguistic Analysis and Machine Learning Applications", author = "Cavar, Damir and Mompelat, Ludovic and Abdo, Muhammad", editor = "Hahn, Michael and Sorokin, Alexey and Kumar, Ritesh and Shcherbakov, Andreas and Otmakhova, Yulia and Yang, Jinrui and Serikov, Oleg and Rani, Priya and Ponti, Edoardo M. and Murado{\u{g}}lu, Saliha and Gao, Rena and Cotterell, Ryan and Vylomova, Ekaterina", booktitle = "Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP", month = mar, year = "2024", address = "St. Julian's, Malta", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2024.sigtyp-1.6", pages = "46--54" }
@inproceedings{cavar-atal-2004-computing, author = "Cavar, Damir and Zoran Tiganj and Ludovic Mompelat and Billy Dickson", title = {Computing Ellipsis Constructions: Comparing Classical {NLP} and {LLM} Approaches}, booktitle = {2024 Meeting of the Society for Computation in Linguistics (SCiL)}, month = may, year = {2024}, address = {}, publisher = {}, url = {}, pages = "--" } ```
Owner
- Name: Damir Cavar
- Login: dcavar
- Kind: user
- Location: Bloomington, IN
- Company: Indiana University
- Website: http://damir.cavar.me/
- Repositories: 29
- Profile: https://github.com/dcavar
GitHub Events
Total
- Member event: 1
Last Year
- Member event: 1
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Van Holt | v****h@i****u | 45 |
| Damir Cavar | d****r@m****m | 17 |
Issues and Pull Requests
Last synced: 9 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0