greatapet2t-g4s

Evolutionary Dynamics of Predicted G-Quadruplexes in Human and Other Great Apes

https://github.com/makovalab-psu/greatapet2t-g4s

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 5 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.5%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Evolutionary Dynamics of Predicted G-Quadruplexes in Human and Other Great Apes

Basic Info
  • Host: GitHub
  • Owner: makovalab-psu
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 1.9 GB
Statistics
  • Stars: 0
  • Watchers: 4
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme License Citation

README.md

DOI

Evolutionary Dynamics of Predicted G-Quadruplexes in Human and Other Great Apes

This repository contains all the code and data generated for the paper.

Saswat K. Mohanty, Francesca Chiaromonte, Kateryna D. Makova*

Department of Biology, Penn State University, University Park, PA 16802
Department of Statistics, Penn State University, University Park, PA 16802
*Correspondence to Kateryna D. Makova (kdm16@psu.edu)

Directory Structure

This repository contains the following directories:

  • src: Includes all Python and Bash scripts necessary for analyzing data, generating datasets, or creating plots as required for this manuscript.

  • jupyterNotebooks: Contains Python-based Jupyter notebooks for analyzing data and generating plots or datasets relevant to the manuscript.

  • datasets: Hosts all datasets used or generated during the analyses described in this manuscript.

  • plots: Stores all plots generated as part of the manuscripts analyses.

Some key dependencies required for the analyses in this manuscript are:

  • mapsea: A general-purpose repository containing scripts for mapping short elements stored in .BED files (e.g., G-quadruplex intervals) to .MAF (alignment) files.

  • g4Discovery: A general-purpose repository with scripts for predicting and annotating G-quadruplexes (G4s) in genome sequences, combining pqsfinder and G4Hunter.

Predicted G4s with Evolutionary Information

For downloading pG4s across the six great ape speciesHomo sapiens, Pan paniscus, Pan troglodytes, Gorilla gorilla, Pongo abelii, and Pongo pygmaeusnavigate to the datasets/pG4s/ directory. Each species subdirectory contains a file named: wholeGenome.pG4s.withSharingInfo.bed. This file uses a custom .BED format that includes additional evolutionary and sequence information. The columns in this file are as follows:

markdown - CHR: chromosome - START: pG4 start - END: pG4 end - ID: #unique 10-digit ID assigned to this pG4, other species sharing this pG4 will have same ID - PQSSCORE: pqsfinder score - STRAND: strand in which pG4 is present - G4HUNTERSCORE: G4Hunter score - SHARED: shared pG4(S), aligned (ASS) or unaligned species-specific (USS) pG4 - WITH: with which species it is shared {B: Bonobo, C: Chimpanzee, H: Human, G: Gorilla, Bo: Bornean Orangutan, So: Sumatran Orangutan} - DUPLICATED: If it is shared with the same chromosome in the same species (>1 indicates duplicated) - SEQ: pG4 sequence in positive-strand - MOTIF: standard (S) or bulged (B) pG4 motif

Using the file as a standard BED file: To convert the file into a standard 6-column .BED format (removing evolutionary and sequence details), use the following command:

bash cut -f 1-6 wholeGenome.pG4s.withSharingInfo.bed > wholeGenome.pG4s.bed

Citation

If you use any of this data or tool in your research, please cite the following paper:

Mohanty, S. K., Chiaromonte F., & Makova, K. D. (2025). Evolutionary dynamics of predicted G-quadruplexes in human and other great apes. Genome Biology, Vol. 26(161), DOI: 10.1186/s13059-025-03635-1

Owner

  • Name: makovalab-psu
  • Login: makovalab-psu
  • Kind: organization

GitHub Events

Total
  • Member event: 1
  • Push event: 21
  • Fork event: 1
  • Create event: 2
Last Year
  • Member event: 1
  • Push event: 21
  • Fork event: 1
  • Create event: 2