dmethr

DNA Methylation repression of transcriptional activity

https://github.com/fchuffar/dmethr

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.0%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

DNA Methylation repression of transcriptional activity

Basic Info
  • Host: GitHub
  • Owner: fchuffar
  • License: gpl-3.0
  • Language: R
  • Default Branch: main
  • Size: 1010 KB
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 1
  • Open Issues: 0
  • Releases: 2
Created about 5 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Authors Codemeta

README.md

dmethr: a 3-step pipeline to explore DNA methylation controlled gene expression from omics data

DNA methylation, the addition of a methyl group to the DNA molecule, is one of several epigenetic mechanisms that cells use to control gene expression. The most studied DNA methylation mark is that of a Cytosine preceding a Guanine (CpG dinucleotides). Since DNA methylation levels have been found lower in many cancers compared to most mammalian somatic tissues [Nordor 2017] [Zhang 2021], reseachers have tried to investigate the implication of DNA methylation as a key regulator of gene expression contributing to cancer development [Jones 2012] [Shi 2020]. Although methylation of CpG-rich regions (called CpG islands) has been associated with stable gene repression, the rules directing the relationship between DNA methylation and gene expression have not yet been fully established. In order to address this question, researchers need to explore -omics data of various origins, including genome-wide genomic features, methylomes (DNA methylation data) and transcriptomic data. Previous studies have established 2/3 of human genes are associated with low methylated CpG-rich promoter ( figure 1) region, regardless of their expression levels, a small subset of genes with high tissue specificity of expression and whose repression in most normal non-germline (somatic) tissues is associated with highly methylated CpG-rich promoter region in non-cancer. The bio-informatic pipelines developed here aimed at identifying these genes and test their expression variation upon demethylation in experimental models and in cancer samples. The first objective of this study is to classify genes according to the three following characteristics: i) CpG density of their promoter region, ii) level of methylation of the promoter region in normal tissues, iii) level of expression in a demethylating context. The aim is to identify a set of genes whose promoter region is CpG-rich and widely methylated in most tissues and whose expression is up-regulated in a demethylating context. The relationship between the expression of these genes and the methylation of their promoter region is then explored in the context of cancer. For this purpose we are developing dmethr.

Installation

devtools::install_github("fchuffar/dmethr")

Conda env

``` source ~/conda_config.sh

conda create -n dmethr_env

conda activate dmethr_env

mamba install -c anaconda -c bioconda -c conda-forge -c r r-base libopenblas bioconductor-geoquery bioconductor-affy bioconductor-biobase r-seqinr r-rcpparmadillo r-devtools r-fastmap r-matrix r-kernsmooth r-catools r-gtools r-nortest r-survival r-beanplot r-gplots r-sass jquery r-nlme r-bslib r-sourcetools r-fontawesome r-xtable r-httpuv r-dbi r-igraph

devtools::install_github("fchuffar/epimedtools")

gse='GSE41037'; rmarkdown::render('01buildstudygeneric.Rmd', outputfile=paste0('01buildstudy_',gse,'.html'));

gse='GSE119617'; rmarkdown::render('01buildstudygeneric.Rmd', outputfile=paste0('01buildstudy_',gse,'.html'));

gse='GSE40279'; rmarkdown::render('01buildstudygeneric.Rmd', outputfile=paste0('01buildstudy_',gse,'.html'));

gse='GSE42861'; rmarkdown::render('01buildstudygeneric.Rmd', outputfile=paste0('01buildstudy_',gse,'.html'));

dim(s$data) # problem with GSE42861 : only 374449 probes. not the case with R3.6.1

snakemake -k --cores 1 -s 00customstudies_wf.py -pn

```

Run pipeline

rmarkdown::render("00_dmethr_pipeline.Rmd") rmarkdown::render("01_tss_cpg_status.Rmd") rmarkdown::render("02_mapreduce_mean.Rmd") rmarkdown::render("03_da_GSE45332.Rmd") rmarkdown::render("03_da_GSE5816.Rmd") rmarkdown::render("03_da_GSE14315.Rmd") rmarkdown::render("04_results.Rmd") rmarkdown::render("00_dmethr_poster.Rmd") rmarkdown::render("00_dmethr_poster.Rmd", output_format="pdf_document")

Run pipeline on the cluster

Cluster prerequisites

https://github.com/fchuffar/practiclesessions/blob/master/cimentinfrastructure/config_cluster.sh

Package requirements

```

BSgenome.Hsapiens.UCSC.hg38

BiocManager::install("BSgenome.Hsapiens.UCSC.hg38")

cd ~/projects ln -s ~/../chuffarf/projects/tcgastudies ln -s ~/../chuffarf/projects/genes ln -s ~/../chuffarf/projects/studygpl570 ln -s ~/../chuffarf/projects/allhumantissues ln -s ~/../chuffarf/projects/hadaca ```

Launch pipeline

``` ssh dahu chandler oarsub --project epimed -l /nodes=1,core=32,walltime=00:30:00 -I -t devel oarsub --project epimed -l /nodes=1,core=4,walltime=6:00:00 -I oarsub --project epimed -p "network_address='luke41'" -l /core=2,walltime=168:00:00 -I

mkdir -p ~/projects/ cd ~/projects/ git clone https://github.com/fchuffar/dmethr.git cd ~/projects/dmethr/vignettes git pull

R

source("00runpipeline.R")

rmarkdown::render("00metaanalysis.Rmd")

```

Knit meta_analysis on laptop

``` cd ~/projects/dmethr/vignettes/ rsync -auvP dahu:~/projects/dmethr/vignettes/*.xlsx ~/projects/dmethr/vignettes/.

rsync -auvP dahu:~/../chuffarf/projects/dmethr/vignettes/*.xlsx ~/projects/dmethr/vignettes/.

R

rmarkdown::render("00metaanalysis.Rmd")

```

Development

devtools::document(); devtools::install(); devtools::check(build_args="--no-build-vignettes")

Owner

  • Name: Florent Chuffart
  • Login: fchuffar
  • Kind: user
  • Location: Grenoble
  • Company: INSERM

CodeMeta (codemeta.json)

{
  "@context": "https://doi.org/10.5063/schema/codemeta-2.0",
  "@type": "SoftwareSourceCode",
  "license": "https://spdx.org/licenses/GPL-3.0",
  "codeRepository": "https://github.com/fchuffar/dmethr",
  "dateCreated": "2021-08-31",
  "name": "dmethr",
  "description": "dmethr: a 3-step pipeline to explore DNA methylation controlled gene expression from omics data",
  "applicationCategory": "Cellular Biology",
  "programmingLanguage": [
    "R"
  ],
  "author": [
    {
      "@type": "Person",
      "givenName": "Florence",
      "familyName": "Pittion"
    }
  ]
}

GitHub Events

Total
Last Year

Dependencies

DESCRIPTION cran
  • methods * depends
  • gplots * imports
  • grDevices * imports
  • igraph * imports
  • memoise * imports
  • survival * imports
  • utils * imports
  • knitr * suggests
  • rmarkdown * suggests