Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.1%) to scientific vocabulary
Repository
The Hoosier Ellipsis Corpus (THEC) - Arabic Sub-corpus
Basic Info
- Host: GitHub
- Owner: dcavar
- License: apache-2.0
- Language: Jupyter Notebook
- Default Branch: main
- Size: 313 KB
Statistics
- Stars: 0
- Watchers: 3
- Forks: 1
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
The Hoosier Ellipsis Corpus (THEC) - Arabic Sub-corpus (thec_ara)
(C) 2024 NLP-Lab
More details about the Hoosier Ellipsis Corpus can be found on the NLP-Lab pages. The GitHub repo contains links to other languages and useful code and scripts for data processing.
This repo contains the Arabic Ellipsis Sub-corpus of THEC.
Consult the data format specification for details about the structure of the files and the annotation standard used.
Maintainer
- Muhammed S. Abdo
- Damir Cavar
Citation
Please use the following snippet to cite our work.
```bibtex @inproceedings{cavar-etal-2024-typology, title = "The Typology of Ellipsis: A Corpus for Linguistic Analysis and Machine Learning Applications", author = "Cavar, Damir and Mompelat, Ludovic and Abdo, Muhammad", editor = "Hahn, Michael and Sorokin, Alexey and Kumar, Ritesh and Shcherbakov, Andreas and Otmakhova, Yulia and Yang, Jinrui and Serikov, Oleg and Rani, Priya and Ponti, Edoardo M. and Murado{\u{g}}lu, Saliha and Gao, Rena and Cotterell, Ryan and Vylomova, Ekaterina", booktitle = "Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP", month = mar, year = "2024", address = "St. Julian's, Malta", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2024.sigtyp-1.6", pages = "46--54" }
@inproceedings{cavar-atal-2004-computing, author = "Cavar, Damir and Zoran Tiganj and Ludovic Mompelat and Billy Dickson", title = {Computing Ellipsis Constructions: Comparing Classical {NLP} and {LLM} Approaches}, booktitle = {2024 Meeting of the Society for Computation in Linguistics (SCiL)}, month = may, year = {2024}, address = {}, publisher = {}, url = {}, pages = "--" } ```
Owner
- Name: Damir Cavar
- Login: dcavar
- Kind: user
- Location: Bloomington, IN
- Company: Indiana University
- Website: http://damir.cavar.me/
- Repositories: 29
- Profile: https://github.com/dcavar
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Cavar" given-names: "Damir" orcid: "https://orcid.org/0000-0002-1262-5927" - family-names: "Mompelat" given-names: "Ludovic Veta" - family-names: "Abdo" given-names: "Muhammed S" title: "The Typology of Ellipsis: A Corpus for Linguistic Analysis and Machine Learning Applications" version: 2.0.4 date-released: 2024-03-22 url: "https://github.com/dcavar/hoosierellipsiscorpus"
GitHub Events
Total
- Push event: 16
- Fork event: 1
Last Year
- Push event: 16
- Fork event: 1
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Muhsabrys | M****s@o****m | 41 |
| Damir Cavar | d****r@m****m | 13 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0