Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.6%) to scientific vocabulary
Repository
Short Tandem Repeat disease loci resource
Basic Info
- Host: GitHub
- Owner: dashnowlab
- License: mit
- Language: Python
- Default Branch: main
- Homepage: http://strchive.org/
- Size: 65.4 MB
Statistics
- Stars: 21
- Watchers: 5
- Forks: 7
- Open Issues: 22
- Releases: 21
Metadata Files
README.md
STRchive
Short Tandem Repeats (STRs) are a type of genetic variation that are associated with many rare diseases. Information about pathogenic STRs is often out-of-date and scattered across different databases, making it difficult to find and interpret STR variants. STRchive ("ess tee archive") aims to solve this problem by providing a central community resource.
⭐️ View the data at strchive.org ⭐️
If you use STRchive in your research, please cite: Hiatt, L., Weisburd, B., Dolzhenko, E., Rubinetti, V., Avvaru, A.K., VanNoy, G.E., Kurtas, N.E., Rehm, H.L., Quinlan, A. and Dashnow, H.✉, 2025. STRchive: a dynamic resource detailing population-level and locus-specific insights at tandem repeat disease loci. Genome medicine doi: https://doi.org/10.1186/s13073-025-01454-4.
STRchive by Harriet Dashnow is licensed under CC BY 4.0
Contributors
- Harriet Dashnow
- Laurel Hiatt
- Akshay Avvaru
- Vincent Rubinetti
- Macayla Weiner
Contributing
If you notice an error, omission, or update, feel free to leave a comment or create a pull request.
To make a change to the STRchive data itself, please edit data/STRchive-loci.json
Then run the "linting" script and fix any errors:
python scripts/check-loci.py data/STRchive-loci.json
Development
Run all scripts to update STRchive
From the root directory, run:
snakemake
Or to skip retrieve and manubot stages, which will speed things up substantially:
snakemake --config stages="skip-refs"
Update TRGT genotyping catalogs
python scripts/make-catalog.py -g hg38 -f TRGT data/STRchive-loci.json data/STRchive-disease-loci.hg38.TRGT.bed
python scripts/make-catalog.py -g T2T -f TRGT data/STRchive-loci.json data/STRchive-disease-loci.T2T-chm13.TRGT.bed
python scripts/make-catalog.py -g hg19 -f TRGT data/STRchive-loci.json data/STRchive-disease-loci.hg19.TRGT.bed
Update extended BED files
python scripts/make-catalog.py -f bed -g hg38 data/STRchive-loci.json data/STRchive-disease-loci.hg38.bed
python scripts/make-catalog.py -f bed -g T2T data/STRchive-loci.json data/STRchive-disease-loci.T2T-chm13.bed
python scripts/make-catalog.py -f bed -g hg19 data/STRchive-loci.json data/STRchive-disease-loci.hg19.bed
Install dependencies
New install:
conda env create --file scripts/environment.yml
conda activate strchive
Update existing installation:
conda activate strchive
conda env update --file scripts/environment.yml --prune
conda activate strchive
Note: biomaRt isn't playing nicely with conda, so installing it within the R script where it is used.
Owner
- Name: dashnowlab
- Login: dashnowlab
- Kind: organization
- Repositories: 1
- Profile: https://github.com/dashnowlab
Citation (CITATION.cff)
title: STRchive
version: 2.9.1
date-released: "2025-08-18"
url: https://github.com/dashnowlab/STRchive
authors:
- family-names: Dashnow
given-names: Harriet
orcid: https://orcid.org/0000-0001-8433-6270
cff-version: 1.2.0
message: "If you use this software, please cite the paper below."
preferred-citation:
type: article
authors:
- family-names: Hiatt
given-names: Laurel
- family-names: Weisburd
given-names: Ben
- family-names: Dolzhenko
given-names: Egor
- family-names: Rubinetti
given-names: Vincent
- family-names: Avvaru
given-names: Akshay K.
- family-names: VanNoy
given-names: Grace E.
- family-names: Kurtas
given-names: Nehir Edibe
- family-names: Rehm
given-names: Heidi L.
- family-names: Quinlan
given-names: Aaron R.
- family-names: Dashnow
given-names: Harriet
doi: 10.1186/s13073-025-01454-4
journal: "Genome Medicine"
month: 3
title: "STRchive: a dynamic resource detailing population-level and locus-specific insights at tandem repeat disease loci"
issue: 1
volume: 17
year: 2025
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 44
- Total pull requests: 71
- Average time to close issues: 17 days
- Average time to close pull requests: 9 days
- Total issue authors: 9
- Total pull request authors: 5
- Average comments per issue: 0.64
- Average comments per pull request: 1.34
- Merged pull requests: 34
- Bot issues: 0
- Bot pull requests: 13
Past Year
- Issues: 42
- Pull requests: 66
- Average time to close issues: 18 days
- Average time to close pull requests: 9 days
- Issue authors: 8
- Pull request authors: 5
- Average comments per issue: 0.64
- Average comments per pull request: 1.44
- Merged pull requests: 31
- Bot issues: 0
- Bot pull requests: 13
Top Authors
Issue Authors
- hdashnow (39)
- laurelhiatt (7)
- bw2 (5)
- Macayla-weiner (4)
- vincerubinetti (3)
- strchive-bot (3)
- uso2015 (1)
- gabeng (1)
- avvaruakshay (1)
- jakobht96 (1)
- b8hale (1)
- egor-dolzhenko (1)
Pull Request Authors
- hdashnow (54)
- laurelhiatt (43)
- vincerubinetti (16)
- github-actions[bot] (12)
- Macayla-weiner (11)
- avvaruakshay (3)
- Checunmily (1)