strchive

Short Tandem Repeat disease loci resource

https://github.com/dashnowlab/strchive

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.6%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Short Tandem Repeat disease loci resource

Basic Info
  • Host: GitHub
  • Owner: dashnowlab
  • License: mit
  • Language: Python
  • Default Branch: main
  • Homepage: http://strchive.org/
  • Size: 65.4 MB
Statistics
  • Stars: 21
  • Watchers: 5
  • Forks: 7
  • Open Issues: 22
  • Releases: 21
Created over 2 years ago · Last pushed 6 months ago
Metadata Files
Readme License Citation

README.md

STRchive

Short Tandem Repeats (STRs) are a type of genetic variation that are associated with many rare diseases. Information about pathogenic STRs is often out-of-date and scattered across different databases, making it difficult to find and interpret STR variants. STRchive ("ess tee archive") aims to solve this problem by providing a central community resource.

⭐️ View the data at strchive.org ⭐️

If you use STRchive in your research, please cite: Hiatt, L., Weisburd, B., Dolzhenko, E., Rubinetti, V., Avvaru, A.K., VanNoy, G.E., Kurtas, N.E., Rehm, H.L., Quinlan, A. and Dashnow, H.✉, 2025. STRchive: a dynamic resource detailing population-level and locus-specific insights at tandem repeat disease loci. Genome medicine doi: https://doi.org/10.1186/s13073-025-01454-4.

STRchive by Harriet Dashnow is licensed under CC BY 4.0

Contributors

  • Harriet Dashnow
  • Laurel Hiatt
  • Akshay Avvaru
  • Vincent Rubinetti
  • Macayla Weiner

Contributing

If you notice an error, omission, or update, feel free to leave a comment or create a pull request.

To make a change to the STRchive data itself, please edit data/STRchive-loci.json

Then run the "linting" script and fix any errors:
python scripts/check-loci.py data/STRchive-loci.json

Development

Run all scripts to update STRchive

From the root directory, run:
snakemake

Or to skip retrieve and manubot stages, which will speed things up substantially:
snakemake --config stages="skip-refs"

Update TRGT genotyping catalogs

python scripts/make-catalog.py -g hg38 -f TRGT data/STRchive-loci.json data/STRchive-disease-loci.hg38.TRGT.bed python scripts/make-catalog.py -g T2T -f TRGT data/STRchive-loci.json data/STRchive-disease-loci.T2T-chm13.TRGT.bed python scripts/make-catalog.py -g hg19 -f TRGT data/STRchive-loci.json data/STRchive-disease-loci.hg19.TRGT.bed

Update extended BED files

python scripts/make-catalog.py -f bed -g hg38 data/STRchive-loci.json data/STRchive-disease-loci.hg38.bed python scripts/make-catalog.py -f bed -g T2T data/STRchive-loci.json data/STRchive-disease-loci.T2T-chm13.bed python scripts/make-catalog.py -f bed -g hg19 data/STRchive-loci.json data/STRchive-disease-loci.hg19.bed

Install dependencies

New install:
conda env create --file scripts/environment.yml conda activate strchive

Update existing installation:
conda activate strchive conda env update --file scripts/environment.yml --prune conda activate strchive

Note: biomaRt isn't playing nicely with conda, so installing it within the R script where it is used.

Owner

  • Name: dashnowlab
  • Login: dashnowlab
  • Kind: organization

Citation (CITATION.cff)

title: STRchive
version: 2.9.1
date-released: "2025-08-18"
url: https://github.com/dashnowlab/STRchive
authors:
  - family-names: Dashnow
    given-names: Harriet
    orcid: https://orcid.org/0000-0001-8433-6270
cff-version: 1.2.0
message: "If you use this software, please cite the paper below."
preferred-citation:
  type: article
  authors:
    - family-names: Hiatt
      given-names: Laurel
    - family-names: Weisburd
      given-names: Ben
    - family-names: Dolzhenko
      given-names: Egor
    - family-names: Rubinetti
      given-names: Vincent
    - family-names: Avvaru
      given-names: Akshay K.
    - family-names: VanNoy
      given-names: Grace E.
    - family-names: Kurtas
      given-names: Nehir Edibe
    - family-names: Rehm
      given-names: Heidi L.
    - family-names: Quinlan
      given-names: Aaron R.
    - family-names: Dashnow
      given-names: Harriet
  doi: 10.1186/s13073-025-01454-4
  journal: "Genome Medicine"
  month: 3
  title: "STRchive: a dynamic resource detailing population-level and locus-specific insights at tandem repeat disease loci"
  issue: 1
  volume: 17
  year: 2025

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 44
  • Total pull requests: 71
  • Average time to close issues: 17 days
  • Average time to close pull requests: 9 days
  • Total issue authors: 9
  • Total pull request authors: 5
  • Average comments per issue: 0.64
  • Average comments per pull request: 1.34
  • Merged pull requests: 34
  • Bot issues: 0
  • Bot pull requests: 13
Past Year
  • Issues: 42
  • Pull requests: 66
  • Average time to close issues: 18 days
  • Average time to close pull requests: 9 days
  • Issue authors: 8
  • Pull request authors: 5
  • Average comments per issue: 0.64
  • Average comments per pull request: 1.44
  • Merged pull requests: 31
  • Bot issues: 0
  • Bot pull requests: 13
Top Authors
Issue Authors
  • hdashnow (39)
  • laurelhiatt (7)
  • bw2 (5)
  • Macayla-weiner (4)
  • vincerubinetti (3)
  • strchive-bot (3)
  • uso2015 (1)
  • gabeng (1)
  • avvaruakshay (1)
  • jakobht96 (1)
  • b8hale (1)
  • egor-dolzhenko (1)
Pull Request Authors
  • hdashnow (54)
  • laurelhiatt (43)
  • vincerubinetti (16)
  • github-actions[bot] (12)
  • Macayla-weiner (11)
  • avvaruakshay (3)
  • Checunmily (1)
Top Labels
Issue Labels
lower-priority (5) enhancement (5) contact (3) locus update (2) wontfix (2) front-end (1) new locus (1) bug (1)
Pull Request Labels
help wanted (1)