northperulex
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.7%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: lexibank
- License: cc-by-4.0
- Language: Python
- Default Branch: main
- Size: 18.6 MB
Statistics
- Stars: 0
- Watchers: 4
- Forks: 0
- Open Issues: 3
- Releases: 2
Metadata Files
README.md
CLDF dataset derived from Ugarte et al.'s "NorthPeruLex - A Lexical Dataset of Small Language Families and Isolates from Northern Peru (forthcoming).
How to cite
If you use these data please cite - the original source
Ugarte, Carlos and Blum, Frederic and Ingunza, Adriano and Gonzales, Rosa and Peña, Jaime. Forthcoming. NorthPeruLex - A Lexical Dataset of Small Language Families and Isolates from Northern Peru. - the derived dataset using the DOI of the particular released version you were using
Description
This dataset brings together lexical data from isolates and small language families from northern Peru to investigate their historic relations.
This dataset is licensed under a CC-BY-4.0 license
Conceptlists in Concepticon: - Swadesh-1952-200
Notes
work in progress
Statistics
- Varieties: 35 (linked to 34 different Glottocodes)
- Concepts: 200 (linked to 200 different Concepticon concept sets)
- Lexemes: 4,804
- Sources: 16
- Synonymy: 1.11
- Cognacy: 4,804 cognates in 3,282 cognate sets (2,554 singletons)
- Cognate Diversity: 0.67
- Invalid lexemes: 0
- Tokens: 27,168
- Segments: 130 (1 BIPA errors, 1 CLTS sound class errors, 129 CLTS modified)
- Inventory size (avg): 26.31
Contributors
Name | GitHub user | Description | Role --- |----------------|-------------------------------------------------| --- Carlos Ugarte | @MuffinLinwist | Data collector, CLDF conversion and annotation | Author, Editor Frederic Blum | @FredericBlum | CLDF conversion and annotation | Author, Editor Adriano Ingunza | @BadBatched | Data collector and annotation | Author Rosa Gonzales | @rosalgm | Data collector and annotation | Author Jaime Peña | @JaimePenat | Data collector and annotation | Author
CLDF Datasets
The following CLDF datasets are available in cldf:
- CLDF Wordlist at cldf/cldf-metadata.json
Owner
- Name: lexibank
- Login: lexibank
- Kind: organization
- Website: http://glottobank.org
- Repositories: 76
- Profile: https://github.com/lexibank
GitHub Events
Total
- Create event: 6
- Release event: 1
- Issues event: 15
- Watch event: 1
- Delete event: 7
- Issue comment event: 46
- Push event: 55
- Pull request review event: 2
- Pull request review comment event: 2
- Pull request event: 10
Last Year
- Create event: 6
- Release event: 1
- Issues event: 15
- Watch event: 1
- Delete event: 7
- Issue comment event: 46
- Push event: 55
- Pull request review event: 2
- Pull request review comment event: 2
- Pull request event: 10
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 7
- Total pull requests: 6
- Average time to close issues: about 1 month
- Average time to close pull requests: 28 days
- Total issue authors: 3
- Total pull request authors: 2
- Average comments per issue: 2.71
- Average comments per pull request: 1.5
- Merged pull requests: 4
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 6
- Pull requests: 6
- Average time to close issues: 16 days
- Average time to close pull requests: 28 days
- Issue authors: 3
- Pull request authors: 2
- Average comments per issue: 3.0
- Average comments per pull request: 1.5
- Merged pull requests: 4
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- MuffinLinwist (4)
- LinguList (2)
- FredericBlum (2)
Pull Request Authors
- MuffinLinwist (4)
- FredericBlum (2)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/checkout v2 composite
- actions/setup-python v2 composite
- Babel ==2.14.0
- Cartopy ==0.22.0
- Jinja2 ==3.1.3
- Markdown ==3.5.2
- MarkupSafe ==2.1.5
- SQLAlchemy ==1.4.51
- appdirs ==1.4.4
- arrow ==1.3.0
- attrs ==23.2.0
- bibtexparser ==2.0.0b7
- bs4 ==0.0.2
- certifi ==2024.2.2
- chardet ==5.2.0
- cldfbench ==1.13.0
- cldfcatalog ==1.5.1
- cldfviz ==1.0.2
- cldfzenodo ==2.1.0
- clldutils ==3.22.1
- colorama ==0.4.6
- colorlog ==6.8.2
- csvw ==3.1.3
- cycler ==0.12.1
- exceptiongroup ==1.2.0
- gitdb ==4.0.11
- idna ==3.6
- igraph ==0.11.5
- importlib_resources ==6.1.3
- iniconfig ==2.0.0
- isodate ==0.6.1
- jmespath ==1.0.1
- jsonschema ==4.21.1
- kiwisolver ==1.4.5
- lingpy ==2.6.13
- lxml ==5.1.0
- matplotlib ==3.8.3
- multipledispatch ==1.0.0
- nameparser ==1.1.3
- networkx ==3.2.1
- newick ==1.9.0
- numpy ==1.26.4
- openpyxl ==3.1.2
- packaging ==23.2
- pluggy ==1.4.0
- purl ==1.6
- pybtex ==0.24.0
- pycldf ==1.34.0
- pyclts ==3.2.0
- pyconcepticon ==3.0.0
- pycountry ==23.12.11
- pyedictor ==0.4
- pyglottolog ==3.12.0
- pylatexenc ==2.10
- pylexibank ==3.4.0
- pyparsing ==3.1.1
- pyproj ==3.6.1
- pytest ==8.0.2
- python-dateutil ==2.8.2
- python-frontmatter ==1.1.0
- python-nexus ==2.9.0
- rdflib ==7.0.0
- referencing ==0.33.0
- regex ==2023.12.25
- reportlab ==4.1.0
- requests ==2.31.0
- rfc3986 ==1.5.0
- scipy ==1.13.1
- segments ==2.2.1
- shapely ==2.0.3
- six ==1.16.0
- smmap ==5.0.1
- soupsieve ==2.5
- tabulate ==0.9.0
- termcolor ==2.4.0
- texttable ==1.7.0
- toyplot ==1.0.3
- toytree ==2.0.1
- tqdm ==4.66.2
- uritemplate ==4.1.1
- urllib3 ==2.2.1
- xlrd ==2.0.1
- zenodoclient ==0.5.1
- zipp ==3.17.0
- cldfbench ==1.13.0
- csvw ==3.1.3
- pycldf ==1.34.0
- pyconcepticon ==3.0.0
- pylexibank >=3.0.0