lingrex

Linguistic Reconstruction with LingPy

https://github.com/lingpy/lingrex

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 17 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.0%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Linguistic Reconstruction with LingPy

Basic Info
Statistics
  • Stars: 14
  • Watchers: 7
  • Forks: 2
  • Open Issues: 5
  • Releases: 7
Created almost 9 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Zenodo

README.md

LingRex: Linguistic Reconstruction with LingPy

Build Status DOI PyPI version

LingRex offers the code needed for the automatic inference of sound correspondence patterns as described in the following paper:

List, J.-M. (2019): Automatic inference of sound correspondence patterns across multiple languages. Computational Linguistics 45.1. 137-161. DOI: 10.1162/colia00344

To test this workflow, please check the workflow code example in tests/workflows/list-2019.

LingRex offers also the code needed for a baseline algorithm for automatic word prediction or automatic phonological reconstruction in a supervised fashion.

List, J.-M. and R. Forkel and N. W. Hill (2022): A New Framework for Fast Automated Phonological Reconstruction Using Trimmed Alignments and Sound Correspondence Patterns. Proceedings of the 3rd International Workshop on Computational Approaches to Historical Language Change (LChange 2022). Dublin. Ireland. https://aclanthology.org/2022.lchange-1.9

This algorithm is also used as a baseline for a Shared Task on the Prediction of Cognate Reflexes (https://sigtyp.github.io/st2022.html), organized as part of the SIGTYP Workshop at NAACL 2022.

List, J.-M., E. Vylomova, R. Forkel, N. Hill, and R. Cotterell (2022): The SIGTYP shared task on the prediction of cognate reflexes. In: Proceedings of the 4th Workshop on Computational Typology and Multilingual NLP. Association for Computational Linguistics 52-62. https://aclanthology.org/2022.sigtyp-1.7

Methods for the handling of partial cognates were introduced in a study by Wu and List (2023):

Wu, M.-S. and J.-M. List (2023): Annotating cognates in phylogenetic studies of South-East Asian languages. Language Dynamics and Change. https://doi.org/10.1163/22105832-bja10023

Methods for the trimming of phonetic alignments were introduced in a study by Blum and List (2023):

Blum, F. and J.-M. List (2023): Trimming phonetic alignments improves the inference of sound correspondence patterns from multilingual wordlists. In: Proceedings of the 5th Workshop on Computational Typology and Multilingual NLP. Association for Computational Linguistics. 52-64. https://aclanthology.org/2023.sigtyp-1.6.pdf

Methods for the handling and creation of fuzzy / uncertain phonological reconstructions were introduced in a study by List et al. (2023):

List, J.-M.; Hill, N. W.; Blum, F.; and Forkel, R. (2023): A New Framework for the Representation and Computation of Uncertainty in Phonological Reconstruction. Proceedings of the 4th Workshop on Computational Approaches to Historical Language Change. 22-32. https://aclanthology.org/2023.lchange-1.3

When using this package in your research, please make sure to quote the respective papers, depending on the algorithms you use, and quote the software package as follows:

List, J.-M. and R. Forkel (2023): LingRex: Linguistic Reconstruction with LingPy. [Computer software, Version 1.4.0]. With contributions by Frederic Blum and Mei-Shin Wu. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://pypi.org/project/lingrex

Since this software package itself makes use of LingPy's alignment algorithms, you should also quote the LingPy package itself.

List, J.-M. and R. Forkel (2023): LingPy. A Python library for quantitative tasks in historical linguistics. Version 2.6.10. Max Planck Institute for Evolutionary Anthropology: Leipzig. https://lingpy.org

Installation

Install the package via pip:

shell pip install lingrex

Further Examples

The borrowing detection algorithm implemented in LingRex is introduced in the paper:

List, J.-M. and R. Forkel (2021): Automated identification of borrowings in multilingual wordlists [version 1; peer review: 3 approved, 1 approved with reservations]. Open Research Europe 1.79. 1-11. DOI: 10.12688/openreseurope.13843.1

If you use this algorithm, please cite LingRex and this paper.

In addition to the paper in which the correspondence pattern inference algorithm was first introduced, LingRex also offers the code to compute the workflow described in the following paper:

Wu, M.-S., N. Schweikhard, T. Bodt, N. Hill, and J.-M. List (2020): Computer-Assisted Language Comparison. State of the Art. Journal of Open Humanities Data 6.2. 1-14. DOI: 10.5334/johd.12

To test this workflow, please check the workflow code example in tests/workflows/wu-2020.

If you use this workflow in your work, please quote this paper as well.

In addition, our experiment (with T. Bodt) on predicting words with the help of sound correspondence patterns also made use of the LingRex package.

Bodt, T. and J.-M. List (2021): Reflex prediction. A case study of Western Kho-Bwa. Diachronica 0.0. 1-38. DOI: 10.1075/dia.20009.bod

To test this workflow, please check the workflow code example in tests/workflows/bodt-2019.

Owner

  • Name: LingPy
  • Login: lingpy
  • Kind: organization
  • Email: info@lingpy.org

Python library for quantitative tasks in historical linguistics

GitHub Events

Total
  • Watch event: 2
Last Year
  • Watch event: 2

Committers

Last synced: about 3 years ago

All Time
  • Total Commits: 83
  • Total Committers: 5
  • Avg Commits per committer: 16.6
  • Development Distribution Score (DDS): 0.518
Top Committers
Name Email Commits
lingulist m****t@l****e 40
Robert Forkel x****g@g****m 15
lingulist m****t@l****g 14
Johann-Mattis List L****t@u****m 10
LinguList m****t@p****e 4
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 34
  • Total pull requests: 14
  • Average time to close issues: about 1 year
  • Average time to close pull requests: 1 day
  • Total issue authors: 6
  • Total pull request authors: 3
  • Average comments per issue: 2.94
  • Average comments per pull request: 3.14
  • Merged pull requests: 13
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • LinguList (28)
  • Tarotis (2)
  • xrotwang (1)
  • PhyloStar (1)
  • martino-vic (1)
  • Wu-Urbanek (1)
Pull Request Authors
  • LinguList (8)
  • xrotwang (5)
  • Tarotis (1)
Top Labels
Issue Labels
bug (4) enhancement (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 130 last-month
  • Total dependent packages: 1
  • Total dependent repositories: 7
  • Total versions: 11
  • Total maintainers: 2
pypi.org: lingrex

Web-Based Tool for Computer-Assisted Language Comparison

  • Versions: 11
  • Dependent Packages: 1
  • Dependent Repositories: 7
  • Downloads: 130 Last month
Rankings
Dependent packages count: 4.7%
Dependent repos count: 5.6%
Average: 14.7%
Stargazers count: 17.1%
Forks count: 22.6%
Downloads: 23.7%
Maintainers (2)
Last synced: 7 months ago

Dependencies

pip-requirements.txt pypi
  • lingpy >=2.6.8
setup.py pypi
  • lingpy >=2.6.8
.github/workflows/python-package.yml actions
  • actions/checkout v4 composite
  • actions/setup-python v4 composite