tsinfer

Infer a tree sequence from genetic variation data.

https://github.com/tskit-dev/tsinfer

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    4 of 12 committers (33.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Infer a tree sequence from genetic variation data.

Basic Info
  • Host: GitHub
  • Owner: tskit-dev
  • License: gpl-3.0
  • Language: C
  • Default Branch: main
  • Homepage:
  • Size: 3.84 MB
Statistics
  • Stars: 60
  • Watchers: 11
  • Forks: 16
  • Open Issues: 108
  • Releases: 17
Created about 9 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog License Citation

README.md

tsinfer

CircleCI Build Status Docs Build codecov

Infer a tree sequence from genetic variation data

The documentation contains details of how to use this software, including installation instructions.

The initial algorithm, its rationale, and results from testing on simulated and real data are described in the following Nature Genetics paper:

Jerome Kelleher, Yan Wong, Anthony W Wohns, Chaimaa Fadil, Patrick K Albers and Gil McVean (2019) Inferring whole-genome histories in large population datasets. Nature Genetics 51: 1330-1338

Tsinfer versions 0.2.0 onwards allow missing data and provide a fully parameterised Li & Stephens matching algorithm (i.e. which allows mismatch). These improvements are described in the following Science paper:

Anthony Wilder Wohns, Yan Wong, Ben Jeffery, Ali Akbari, Swapan Mallick, Ron Pinhasi, Nick Patterson, David Reich, Jerome Kelleher, and Gil McVean (2022) A unified genealogy of modern and ancient genomes. Science 375: eabi8264

Please cite either or both of these if you use tsinfer in your work. Code to reproduce the results in the first paper is present in a separate GitHub repository.

Note that tsinfer does not attempt to infer node times (i.e. branch lengths of the inferred trees). If you require a tree sequence where the dates of common ancestors are expressed in calendar or generation times, you should post-process the tsinfer output using software such as tsdate.

Owner

  • Name: Tskit developers
  • Login: tskit-dev
  • Kind: organization
  • Email: admin@tskit.dev

Software for the creation and analysis of tree-sequences.

Citation (CITATION.md)

(sec_citation)=

# Citing tsinfer

If you use `tsinfer` in your work, please cite the
[2019 Nature Genetics paper](<https://doi.org/10.1038/s41588-019-0483-y>):

> Jerome Kelleher, Yan Wong, Anthony W. Wohns, 
> Chaimaa Fadil, Patrick K. Albers & Gil McVean (2019) 
> *Inferring whole-genome histories in large population datasets*,
> Nature Genetics, Volume 51, 1330–1338. https://doi.org/10.1038/s41588-019-0483-y

Bibtex record:

```bibtex

@article{Kelleher2019,
  doi = {10.1038/s41588-019-0483-y},
  url = {https://doi.org/10.1038/s41588-019-0483-y},
  year = {2019},
  month = sep,
  publisher = {Springer Science and Business Media {LLC}},
  volume = {51},
  number = {9},
  pages = {1330--1338},
  author = {Jerome Kelleher and Yan Wong and Anthony W. Wohns and Chaimaa Fadil and Patrick K. Albers and Gil McVean},
  title = {Inferring whole-genome histories in large population datasets},
  journal = {Nature Genetics}
}
```

GitHub Events

Total
  • Create event: 5
  • Release event: 2
  • Issues event: 55
  • Watch event: 4
  • Delete event: 1
  • Issue comment event: 187
  • Push event: 36
  • Pull request review comment event: 56
  • Pull request review event: 79
  • Pull request event: 59
  • Fork event: 3
Last Year
  • Create event: 5
  • Release event: 2
  • Issues event: 55
  • Watch event: 4
  • Delete event: 1
  • Issue comment event: 187
  • Push event: 36
  • Pull request review comment event: 56
  • Pull request review event: 79
  • Pull request event: 59
  • Fork event: 3

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 1,135
  • Total Committers: 12
  • Avg Commits per committer: 94.583
  • Development Distribution Score (DDS): 0.381
Past Year
  • Commits: 117
  • Committers: 7
  • Avg Commits per committer: 16.714
  • Development Distribution Score (DDS): 0.573
Top Committers
Name Email Commits
Jerome Kelleher jk@w****k 702
Yan Wong y****n@y****e 151
Yan Wong y****g@b****k 138
Ben Jeffery b****y@b****k 43
dependabot[bot] 4****]@u****m 34
dependabot-preview[bot] 2****]@u****m 27
Ben Jeffery b****y@g****m 16
Ben Jeffery b****y@w****k 11
awohns a****s@g****m 9
Shing Zhan s****n@g****m 2
Inés Rebollo 4****o@u****m 1
dianali d****z@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 164
  • Total pull requests: 218
  • Average time to close issues: 11 months
  • Average time to close pull requests: about 1 month
  • Total issue authors: 16
  • Total pull request authors: 7
  • Average comments per issue: 2.99
  • Average comments per pull request: 3.72
  • Merged pull requests: 170
  • Bot issues: 0
  • Bot pull requests: 2
Past Year
  • Issues: 31
  • Pull requests: 77
  • Average time to close issues: 7 days
  • Average time to close pull requests: 5 days
  • Issue authors: 8
  • Pull request authors: 5
  • Average comments per issue: 2.0
  • Average comments per pull request: 1.94
  • Merged pull requests: 60
  • Bot issues: 0
  • Bot pull requests: 2
Top Authors
Issue Authors
  • hyanwong (78)
  • benjeffery (41)
  • jeromekelleher (29)
  • awohns (3)
  • frezaei98 (2)
  • percyfal (1)
  • evolgenomics (1)
  • duncanMR (1)
  • nspope (1)
  • szhan (1)
  • tforest (1)
  • stsmall (1)
  • moshejasper (1)
  • Tatianabellagio (1)
  • bgyuris (1)
Pull Request Authors
  • benjeffery (118)
  • hyanwong (80)
  • jeromekelleher (14)
  • mergify[bot] (2)
  • duncanMR (2)
  • hossam26644 (1)
  • eaton-lab (1)
Top Labels
Issue Labels
enhancement (6) bug (5) documentation (1)
Pull Request Labels
AUTOMERGE-REQUESTED (34)

Packages

  • Total packages: 2
  • Total downloads:
    • pypi 1,272 last-month
  • Total dependent packages: 1
    (may contain duplicates)
  • Total dependent repositories: 9
    (may contain duplicates)
  • Total versions: 19
  • Total maintainers: 2
pypi.org: tsinfer

Infer tree sequences from genetic variation data.

  • Versions: 16
  • Dependent Packages: 1
  • Dependent Repositories: 8
  • Downloads: 1,272 Last month
  • Docker Downloads: 0
Rankings
Docker downloads count: 3.5%
Dependent packages count: 4.7%
Dependent repos count: 5.2%
Average: 6.0%
Downloads: 10.5%
Maintainers (2)
Last synced: 6 months ago
conda-forge.org: tsinfer

tsinfer infers succinct tree sequences from observed genetic variation data

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 1
Rankings
Dependent repos count: 24.3%
Average: 40.4%
Stargazers count: 41.9%
Forks count: 43.8%
Dependent packages count: 51.6%
Last synced: 6 months ago

Dependencies

requirements/CI-tests-complete/requirements.txt pypi
  • attrs ==21.4.0
  • codecov ==2.1.12
  • colorama ==0.4.4
  • daiquiri ==3.0.1
  • h5py ==3.6.0
  • humanize ==4.1.0
  • lmdb ==1.3.0
  • matplotlib ==3.4.1
  • meson ==0.62.0
  • msprime ==1.1.1
  • numpy ==1.21.6
  • pandas ==1.2.5
  • pytest ==7.1.2
  • pytest-cov ==3.0.0
  • pytest-xdist ==2.5.0
  • seaborn ==0.11.2
  • sortedcontainers ==2.4.0
  • tqdm ==4.64.0
  • tskit ==0.5.0
  • zarr ==2.11.3
requirements/CI-tests-conda/requirements.txt pypi
  • h5py ==3.6.0
  • lmdb ==0.9.29
  • msprime ==1.1.1
  • numcodecs ==0.9.1
  • python-lmdb ==1.3.0
requirements/development.txt pypi
  • attrs *
  • codecov *
  • colorama *
  • coverage *
  • daiquiri *
  • flake8 *
  • h5py *
  • humanize *
  • lmdb *
  • matplotlib *
  • msprime >=1.0.0
  • numa *
  • numpy *
  • pandas *
  • pytest *
  • pytest-coverage *
  • pytest-xdist *
  • python-prctl *
  • seaborn *
  • setuptools_scm *
  • six *
  • sortedcontainers *
  • sphinx *
  • sphinx-argparse *
  • sphinx_rtd_theme *
  • tqdm *
  • tskit >=0.4.1
  • zarr *
setup.py pypi
  • attrs >=19.2.0
  • daiquiri *
  • humanize *
  • issues *
  • lmdb *
  • numcodecs >=0.6
  • numpy *
  • six *
  • sortedcontainers *
  • tqdm *
  • tskit >=0.5.0
  • zarr >=2.2,
.github/workflows/docs.yml actions
  • actions/cache v3 composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • styfle/cancel-workflow-action 0.11.0 composite
.github/workflows/tests.yml actions
  • actions/cache v3 composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • conda-incubator/setup-miniconda v2 composite
  • pre-commit/action v3.0.0 composite
  • styfle/cancel-workflow-action 0.11.0 composite
.github/workflows/wheels.yml actions
  • actions/checkout v3 composite
  • actions/download-artifact v3 composite
  • actions/setup-python v4 composite
  • actions/upload-artifact v3 composite
  • pypa/gh-action-pypi-publish master composite
requirements/CI-docs/requirements.txt pypi
  • Babel ==2.11.0
  • GitPython ==3.1.30
  • Jinja2 ==3.1.2
  • MarkupSafe ==2.1.2
  • PyYAML ==6.0
  • Pygments ==2.14.0
  • SQLAlchemy ==1.4.46
  • Send2Trash ==1.8.0
  • Sphinx ==4.5.0
  • alabaster ==0.7.13
  • anyio ==3.6.2
  • argon2-cffi ==21.3.0
  • argon2-cffi-bindings ==21.2.0
  • arrow ==1.2.3
  • asciitree ==0.3.3
  • asttokens ==2.2.1
  • attrs ==21.4.0
  • backcall ==0.2.0
  • beautifulsoup4 ==4.11.1
  • bleach ==5.0.1
  • certifi ==2022.12.7
  • cffi ==1.15.1
  • charset-normalizer ==3.0.1
  • click ==8.1.3
  • cloudpickle ==2.2.1
  • colorama ==0.4.6
  • coloredlogs ==15.0.1
  • comm ==0.1.2
  • cyvcf2 ==0.30.18
  • daiquiri ==3.2.1
  • dask ==2022.1.0
  • debugpy ==1.6.5
  • decorator ==5.1.1
  • defusedxml ==0.7.1
  • demes ==0.2.2
  • docutils ==0.17.1
  • entrypoints ==0.4
  • executing ==1.2.0
  • fasteners ==0.18
  • fastjsonschema ==2.16.2
  • fqdn ==1.5.1
  • fsspec ==2023.1.0
  • gitdb ==4.0.10
  • greenlet ==2.0.1
  • humanfriendly ==10.0
  • humanize ==4.4.0
  • idna ==3.4
  • imagesize ==1.4.1
  • importlib-metadata ==6.0.0
  • ipykernel ==6.20.2
  • ipython ==8.8.0
  • ipython-genutils ==0.2.0
  • ipywidgets ==7.7.2
  • isoduration ==20.11.0
  • jedi ==0.18.2
  • jsonpointer ==2.3
  • jsonschema ==4.17.3
  • jupyter-book ==0.13.1
  • jupyter-cache ==0.4.3
  • jupyter-events ==0.6.3
  • jupyter-server-mathjax ==0.2.6
  • jupyter-sphinx ==0.3.2
  • jupyter_client ==7.4.9
  • jupyter_core ==5.1.3
  • jupyter_server ==2.1.0
  • jupyter_server_terminals ==0.4.4
  • jupyterlab-pygments ==0.2.2
  • jupyterlab-widgets ==1.1.1
  • latexcodec ==2.0.1
  • linkify-it-py ==1.0.3
  • lmdb ==1.4.0
  • locket ==1.0.0
  • lxml ==4.9.2
  • markdown-it-py ==1.1.0
  • matplotlib-inline ==0.1.6
  • mdit-py-plugins ==0.2.8
  • mistune ==0.8.4
  • msprime ==1.2.0
  • myst-nb ==0.13.2
  • myst-parser ==0.15.2
  • nbclassic ==0.4.8
  • nbclient ==0.5.13
  • nbconvert ==6.5.4
  • nbdime ==3.1.1
  • nbformat ==5.7.3
  • nest-asyncio ==1.5.6
  • newick ==1.6.0
  • notebook ==6.5.2
  • notebook_shim ==0.2.2
  • numcodecs ==0.11.0
  • numpy ==1.24.1
  • packaging ==23.0
  • pandas ==1.5.3
  • pandocfilters ==1.5.0
  • parso ==0.8.3
  • partd ==1.3.0
  • pexpect ==4.8.0
  • pickleshare ==0.7.5
  • platformdirs ==2.6.2
  • prometheus-client ==0.15.0
  • prompt-toolkit ==3.0.36
  • psutil ==5.9.4
  • ptyprocess ==0.7.0
  • pure-eval ==0.2.2
  • pybtex ==0.24.0
  • pybtex-docutils ==1.0.2
  • pycparser ==2.21
  • pydata-sphinx-theme ==0.8.1
  • pyrsistent ==0.19.3
  • python-dateutil ==2.8.2
  • python-json-logger ==2.0.4
  • pytz ==2022.7.1
  • pyzmq ==25.0.0
  • requests ==2.28.2
  • rfc3339-validator ==0.1.4
  • rfc3986-validator ==0.1.1
  • ruamel.yaml ==0.17.21
  • ruamel.yaml.clib ==0.2.7
  • six ==1.16.0
  • smmap ==5.0.0
  • sniffio ==1.3.0
  • snowballstemmer ==2.2.0
  • sortedcontainers ==2.4.0
  • soupsieve ==2.3.2.post1
  • sphinx-argparse ==0.4.0
  • sphinx-book-theme ==0.3.3
  • sphinx-comments ==0.0.3
  • sphinx-copybutton ==0.5.1
  • sphinx-external-toc ==0.2.4
  • sphinx-issues ==3.0.1
  • sphinx-jupyterbook-latex ==0.4.7
  • sphinx-multitoc-numbering ==0.1.3
  • sphinx-thebe ==0.1.2
  • sphinx-togglebutton ==0.3.2
  • sphinx_design ==0.1.0
  • sphinxcontrib-bibtex ==2.5.0
  • sphinxcontrib-devhelp ==1.0.2
  • sphinxcontrib-htmlhelp ==2.0.0
  • sphinxcontrib-jsmath ==1.0.1
  • sphinxcontrib-qthelp ==1.0.3
  • sphinxcontrib-serializinghtml ==1.1.5
  • sphinxcontrib.applehelp ==1.0.3
  • stack-data ==0.6.2
  • svgwrite ==1.4.3
  • terminado ==0.17.1
  • tinycss2 ==1.2.1
  • toolz ==0.12.0
  • tornado ==6.2
  • tqdm ==4.64.1
  • traitlets ==5.8.1
  • tskit ==0.5.4
  • tskit-book-theme ==0.3.2
  • uc-micro-py ==1.0.1
  • uri-template ==1.2.0
  • urllib3 ==1.26.14
  • wcwidth ==0.2.6
  • webcolors ==1.12
  • webencodings ==0.5.1
  • websocket-client ==1.4.2
  • widgetsnbextension ==3.6.1
  • zarr ==2.13.6
  • zipp ==3.11.0