greatapet2t-g4s
Evolutionary Dynamics of Predicted G-Quadruplexes in Human and Other Great Apes
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 5 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.5%) to scientific vocabulary
Repository
Evolutionary Dynamics of Predicted G-Quadruplexes in Human and Other Great Apes
Basic Info
Statistics
- Stars: 0
- Watchers: 4
- Forks: 1
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Evolutionary Dynamics of Predicted G-Quadruplexes in Human and Other Great Apes
This repository contains all the code and data generated for the paper.
Saswat K. Mohanty, Francesca Chiaromonte, Kateryna D. Makova*
Department of Biology, Penn State University, University Park, PA 16802
Department of Statistics, Penn State University, University Park, PA 16802
*Correspondence to Kateryna D. Makova (kdm16@psu.edu)
Directory Structure
This repository contains the following directories:
src: Includes all Python and Bash scripts necessary for analyzing data, generating datasets, or creating plots as required for this manuscript.jupyterNotebooks: Contains Python-based Jupyter notebooks for analyzing data and generating plots or datasets relevant to the manuscript.datasets: Hosts all datasets used or generated during the analyses described in this manuscript.plots: Stores all plots generated as part of the manuscripts analyses.
Some key dependencies required for the analyses in this manuscript are:
mapsea: A general-purpose repository containing scripts for mapping short elements stored in.BEDfiles (e.g., G-quadruplex intervals) to.MAF(alignment) files.g4Discovery: A general-purpose repository with scripts for predicting and annotating G-quadruplexes (G4s) in genome sequences, combiningpqsfinderandG4Hunter.
Predicted G4s with Evolutionary Information
For downloading pG4s across the six great ape speciesHomo sapiens, Pan paniscus, Pan troglodytes, Gorilla gorilla, Pongo abelii, and Pongo pygmaeusnavigate to the datasets/pG4s/ directory. Each species subdirectory contains a file named: wholeGenome.pG4s.withSharingInfo.bed. This file uses a custom .BED format that includes additional evolutionary and sequence information. The columns in this file are as follows:
markdown
- CHR: chromosome
- START: pG4 start
- END: pG4 end
- ID: #unique 10-digit ID assigned to this pG4, other species sharing this pG4 will have same ID
- PQSSCORE: pqsfinder score
- STRAND: strand in which pG4 is present
- G4HUNTERSCORE: G4Hunter score
- SHARED: shared pG4(S), aligned (ASS) or unaligned species-specific (USS) pG4
- WITH: with which species it is shared
{B: Bonobo, C: Chimpanzee, H: Human, G: Gorilla,
Bo: Bornean Orangutan, So: Sumatran Orangutan}
- DUPLICATED: If it is shared with the same chromosome in the same species (>1 indicates duplicated)
- SEQ: pG4 sequence in positive-strand
- MOTIF: standard (S) or bulged (B) pG4 motif
Using the file as a standard BED file: To convert the file into a standard 6-column .BED format (removing evolutionary and sequence details), use the following command:
bash
cut -f 1-6 wholeGenome.pG4s.withSharingInfo.bed > wholeGenome.pG4s.bed
Citation
If you use any of this data or tool in your research, please cite the following paper:
Mohanty, S. K., Chiaromonte F., & Makova, K. D. (2025). Evolutionary dynamics of predicted G-quadruplexes in human and other great apes. Genome Biology, Vol. 26(161),
DOI: 10.1186/s13059-025-03635-1
Owner
- Name: makovalab-psu
- Login: makovalab-psu
- Kind: organization
- Repositories: 13
- Profile: https://github.com/makovalab-psu
GitHub Events
Total
- Member event: 1
- Push event: 21
- Fork event: 1
- Create event: 2
Last Year
- Member event: 1
- Push event: 21
- Fork event: 1
- Create event: 2