glottolog

Collaborative data curation for Glottolog

https://github.com/glottolog/glottolog

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    4 of 32 committers (12.5%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.2%) to scientific vocabulary

Keywords

glottolog linguistics
Last synced: 7 months ago · JSON representation

Repository

Collaborative data curation for Glottolog

Basic Info
  • Host: GitHub
  • Owner: glottolog
  • License: other
  • Language: TeX
  • Default Branch: master
  • Homepage: http://glottolog.org
  • Size: 1.33 GB
Statistics
  • Stars: 169
  • Watchers: 19
  • Forks: 157
  • Open Issues: 28
  • Releases: 22
Topics
glottolog linguistics
Created about 10 years ago · Last pushed 8 months ago
Metadata Files
Readme Changelog Contributing License Zenodo

README.md

The Glottolog Data Repository

The Glottolog data repository is the place where the data served by the Glottolog web application is curated. But the repository also provides an alternative way to access Glottolog's data locally, and possibly even locally customized data by forking glottolog/glottolog.

Accessing Glottolog data

  • This repository is the place where Glottolog data is curated. So it's the right place to open issues about errors you identified and to propose changes. A clone of this repository is also the right thing if you need access to all of Glottolog's data, possibly including older versions and the history of changes. Since the format of the data here is tailored towards maintainability - and not towards accessibility - you might want to use the Python package pyglottolog to access it programmatically.
  • glottolog.org - the Glottolog website - may be the most convenient place to inspect and browse the latest released version of Glottolog data. It also provides access to various download formats, tailored towards various re-use scenarios.
  • glottolog as CLDF dataset is probably the best option for accessing all of Glottolog's languoid data. Due to the format being CLDF, it can be used from all kinds of programming environments such as spreadsheet programs, programming languages like R or python, or the UNIX shell. A description of the files in this datasets is available in the README.

How-to cite

Only released versions of the Glottolog data should be cited. These releases are archived with and available from ZENODO at https://doi.org/10.5281/zenodo.596479

Types of data in Glottolog

Languoids

Data about Glottolog languoids (languages, dialects or sub-groups, aka families) is stored in text files (one per languoid) formatted as INI files in the languoids/tree subdirectory. The directory tree mirrors the Glottolog classification of languages.

References

The Glottolog bibliography is curated as a set of BibTeX files in the references/bibtex subdirectory, which are merged into a single reference database for each release/edition.

Metadata

Metadata - e.g. controlled vocabularies for some of the languoid data - are stored as INI files in the config subdirectory.

Owner

  • Name: Glottolog
  • Login: glottolog
  • Kind: organization
  • Email: glottolog@eva.mpg.de

Repositories related to the data curation of Glottolog, the language catalog.

GitHub Events

Total
  • Create event: 31
  • Release event: 1
  • Issues event: 144
  • Watch event: 16
  • Delete event: 25
  • Issue comment event: 156
  • Push event: 70
  • Pull request review comment event: 1
  • Pull request review event: 29
  • Pull request event: 77
  • Fork event: 22
Last Year
  • Create event: 31
  • Release event: 1
  • Issues event: 144
  • Watch event: 16
  • Delete event: 25
  • Issue comment event: 156
  • Push event: 70
  • Pull request review comment event: 1
  • Pull request review event: 29
  • Pull request event: 77
  • Fork event: 22

Committers

Last synced: about 3 years ago

All Time
  • Total Commits: 1,557
  • Total Committers: 32
  • Avg Commits per committer: 48.656
  • Development Distribution Score (DDS): 0.65
Top Committers
Name Email Commits
d97hah h****d@b****e 545
xrotwang x****g@g****m 471
Sebastian Bank s****k@u****e 249
Andreas Hanzl 3****n@u****m 221
Hedvig Skirgård h****d@g****m 12
Martin Haspelmath h****h@u****m 6
Simon J Greenhill S****l@u****m 6
jrnold j****d@g****m 5
Mario J. Lucero m****o@h****m 5
Haikal 8****K@u****m 4
KP van Dam 柯禕藍 y****o@g****m 4
Harald Hammarström h****5@u****e 3
Tavalmayam T****m@u****m 2
Hedvig Skirgård h****c@g****m 2
JoeyKentin 7****n@u****m 2
Viktor Martinović 5****c@u****m 2
Evgeniya Korovina l****n@u****m 2
Arctic Circle System 6****m@u****m 2
Jakob Lesage j****g@u****m 1
Elie Roux e****x@t****u 1
BishnupriyaManipuri 3****i@u****m 1
Hans-Jörg Bibiko h****o@e****e 1
Jackson L. Lee j****e@g****m 1
Gereon Kaiping g****g@h****l 1
Philippe Verdy 1****p@u****m 1
mshd m****4@g****m 1
Tom Dougherty d****m@g****m 1
Subhashish Panigrahi 1****h@u****m 1
Meikal m****l@u****m 1
afehn f****n@s****e 1
and 2 more...

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 77
  • Total pull requests: 26
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 11 days
  • Total issue authors: 26
  • Total pull request authors: 9
  • Average comments per issue: 0.91
  • Average comments per pull request: 0.23
  • Merged pull requests: 15
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 76
  • Pull requests: 26
  • Average time to close issues: 15 days
  • Average time to close pull requests: 11 days
  • Issue authors: 25
  • Pull request authors: 9
  • Average comments per issue: 0.84
  • Average comments per pull request: 0.23
  • Merged pull requests: 15
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • teydrin (15)
  • xrotwang (15)
  • washwall (9)
  • chiarcos (6)
  • yapalic (5)
  • Arctic-Circle-System (5)
  • patkaiist (3)
  • blasterball (3)
  • cristianolongoodhl (2)
  • hmmm816 (2)
  • Glactic-String (2)
  • ReksaTresna (2)
  • gldrk (2)
  • JoeMustafa73 (2)
  • IanAdriano123 (2)
Pull Request Authors
  • d97hah (36)
  • xrotwang (9)
  • xflr6 (3)
  • shuldialect (3)
  • Laitei40 (2)
  • patkaiist (2)
  • Playgirlkaybraz11 (1)
  • CaroleWerner (1)
  • 2004content (1)
  • twagoo (1)
  • erba994 (1)
  • cristianolongoodhl (1)
  • HedvigS (1)
  • rayza (1)
  • kogepara (1)
Top Labels
Issue Labels
duplicate (1) needs_reference (1)
Pull Request Labels