Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.7%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: lexibank
  • License: cc-by-4.0
  • Language: Python
  • Default Branch: main
  • Size: 18.6 MB
Statistics
  • Stars: 0
  • Watchers: 4
  • Forks: 0
  • Open Issues: 3
  • Releases: 2
Created over 2 years ago · Last pushed 11 months ago
Metadata Files
Readme License Zenodo

README.md

CLDF dataset derived from Ugarte et al.'s "NorthPeruLex - A Lexical Dataset of Small Language Families and Isolates from Northern Peru (forthcoming).

CLDF validation

How to cite

If you use these data please cite - the original source

Ugarte, Carlos and Blum, Frederic and Ingunza, Adriano and Gonzales, Rosa and Peña, Jaime. Forthcoming. NorthPeruLex - A Lexical Dataset of Small Language Families and Isolates from Northern Peru. - the derived dataset using the DOI of the particular released version you were using

Description

This dataset brings together lexical data from isolates and small language families from northern Peru to investigate their historic relations.

This dataset is licensed under a CC-BY-4.0 license

Conceptlists in Concepticon: - Swadesh-1952-200

Notes

work in progress

Statistics

CLDF validation Glottolog: 97% Concepticon: 100% Source: 100% BIPA: 99% CLTS SoundClass: 99%

  • Varieties: 35 (linked to 34 different Glottocodes)
  • Concepts: 200 (linked to 200 different Concepticon concept sets)
  • Lexemes: 4,804
  • Sources: 16
  • Synonymy: 1.11
  • Cognacy: 4,804 cognates in 3,282 cognate sets (2,554 singletons)
  • Cognate Diversity: 0.67
  • Invalid lexemes: 0
  • Tokens: 27,168
  • Segments: 130 (1 BIPA errors, 1 CLTS sound class errors, 129 CLTS modified)
  • Inventory size (avg): 26.31

Contributors

Name | GitHub user | Description | Role --- |----------------|-------------------------------------------------| --- Carlos Ugarte | @MuffinLinwist | Data collector, CLDF conversion and annotation | Author, Editor Frederic Blum | @FredericBlum | CLDF conversion and annotation | Author, Editor Adriano Ingunza | @BadBatched | Data collector and annotation | Author Rosa Gonzales | @rosalgm | Data collector and annotation | Author Jaime Peña | @JaimePenat | Data collector and annotation | Author

CLDF Datasets

The following CLDF datasets are available in cldf:

Owner

  • Name: lexibank
  • Login: lexibank
  • Kind: organization

GitHub Events

Total
  • Create event: 6
  • Release event: 1
  • Issues event: 15
  • Watch event: 1
  • Delete event: 7
  • Issue comment event: 46
  • Push event: 55
  • Pull request review event: 2
  • Pull request review comment event: 2
  • Pull request event: 10
Last Year
  • Create event: 6
  • Release event: 1
  • Issues event: 15
  • Watch event: 1
  • Delete event: 7
  • Issue comment event: 46
  • Push event: 55
  • Pull request review event: 2
  • Pull request review comment event: 2
  • Pull request event: 10

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 7
  • Total pull requests: 6
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 28 days
  • Total issue authors: 3
  • Total pull request authors: 2
  • Average comments per issue: 2.71
  • Average comments per pull request: 1.5
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 6
  • Pull requests: 6
  • Average time to close issues: 16 days
  • Average time to close pull requests: 28 days
  • Issue authors: 3
  • Pull request authors: 2
  • Average comments per issue: 3.0
  • Average comments per pull request: 1.5
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • MuffinLinwist (4)
  • LinguList (2)
  • FredericBlum (2)
Pull Request Authors
  • MuffinLinwist (4)
  • FredericBlum (2)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

.github/workflows/cldf-validation.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
cldf/requirements.txt pypi
  • Babel ==2.14.0
  • Cartopy ==0.22.0
  • Jinja2 ==3.1.3
  • Markdown ==3.5.2
  • MarkupSafe ==2.1.5
  • SQLAlchemy ==1.4.51
  • appdirs ==1.4.4
  • arrow ==1.3.0
  • attrs ==23.2.0
  • bibtexparser ==2.0.0b7
  • bs4 ==0.0.2
  • certifi ==2024.2.2
  • chardet ==5.2.0
  • cldfbench ==1.13.0
  • cldfcatalog ==1.5.1
  • cldfviz ==1.0.2
  • cldfzenodo ==2.1.0
  • clldutils ==3.22.1
  • colorama ==0.4.6
  • colorlog ==6.8.2
  • csvw ==3.1.3
  • cycler ==0.12.1
  • exceptiongroup ==1.2.0
  • gitdb ==4.0.11
  • idna ==3.6
  • igraph ==0.11.5
  • importlib_resources ==6.1.3
  • iniconfig ==2.0.0
  • isodate ==0.6.1
  • jmespath ==1.0.1
  • jsonschema ==4.21.1
  • kiwisolver ==1.4.5
  • lingpy ==2.6.13
  • lxml ==5.1.0
  • matplotlib ==3.8.3
  • multipledispatch ==1.0.0
  • nameparser ==1.1.3
  • networkx ==3.2.1
  • newick ==1.9.0
  • numpy ==1.26.4
  • openpyxl ==3.1.2
  • packaging ==23.2
  • pluggy ==1.4.0
  • purl ==1.6
  • pybtex ==0.24.0
  • pycldf ==1.34.0
  • pyclts ==3.2.0
  • pyconcepticon ==3.0.0
  • pycountry ==23.12.11
  • pyedictor ==0.4
  • pyglottolog ==3.12.0
  • pylatexenc ==2.10
  • pylexibank ==3.4.0
  • pyparsing ==3.1.1
  • pyproj ==3.6.1
  • pytest ==8.0.2
  • python-dateutil ==2.8.2
  • python-frontmatter ==1.1.0
  • python-nexus ==2.9.0
  • rdflib ==7.0.0
  • referencing ==0.33.0
  • regex ==2023.12.25
  • reportlab ==4.1.0
  • requests ==2.31.0
  • rfc3986 ==1.5.0
  • scipy ==1.13.1
  • segments ==2.2.1
  • shapely ==2.0.3
  • six ==1.16.0
  • smmap ==5.0.1
  • soupsieve ==2.5
  • tabulate ==0.9.0
  • termcolor ==2.4.0
  • texttable ==1.7.0
  • toyplot ==1.0.3
  • toytree ==2.0.1
  • tqdm ==4.66.2
  • uritemplate ==4.1.1
  • urllib3 ==2.2.1
  • xlrd ==2.0.1
  • zenodoclient ==0.5.1
  • zipp ==3.17.0
setup.py pypi
  • cldfbench ==1.13.0
  • csvw ==3.1.3
  • pycldf ==1.34.0
  • pyconcepticon ==3.0.0
  • pylexibank >=3.0.0