remind-cancer
Bioinformatics pipeline to identify and prioritize activating Promoter SNVs (pSNVs) using genomic, transcriptomic and annotation data.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.1%) to scientific vocabulary
Keywords
Repository
Bioinformatics pipeline to identify and prioritize activating Promoter SNVs (pSNVs) using genomic, transcriptomic and annotation data.
Basic Info
- Host: GitHub
- Owner: nicholas-abad
- Language: Jupyter Notebook
- Default Branch: main
- Homepage: https://remind-cancer.readthedocs.io/en/readthedocs/
- Size: 56 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
Overview of the REMIND-Cancer Filtering Pipeline
Beyond Recurrence: A Novel Workflow to Identify Activating Promoter Mutations in Cancer Genomes
Authors: Nicholas Abad1,2, Irina Glas1,3, Chen Hong1,4, Annika Small3, Yoann Pageaud1,5, Ana Maia3, Dieter Weichenhan6, Christoph Plass6, Barbara Hutter7, Benedikt Brors1,8,9,10, Cindy Körner3, Lars Feuerbach1
1 Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
2 Faculty of Engineering Sciences, Heidelberg University, Heidelberg, Germany
3 Division of Molecular Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
4 Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
5 Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
6 Division of Cancer Epigenomics, German Cancer Research Center (DKFZ), Heidelberg, Germany
7 Computational Oncology Group, Molecular Diagnostics Program at the NCT and German Cancer Research Center (DKFZ), Heidelberg, Germany
8 German Cancer Consortium (DKTK), Core Center Heidelberg, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
9 Medical Faculty Heidelberg and Faculty of Biosciences, Heidelberg University, 69120 Heidelberg, Germany
10 National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 410, 69120 Heidelberg, Germany
Abstract
Cancer is a heterogeneous disease caused by genetic alterations. The computational analysis of cancer genomes led to the expansion of the catalog of functional mutations. While individual high-impact mutations have been discovered also in gene promoters, frequency-based approaches have only characterized a few candidates so far. To facilitate the identification of rare activating promoter mutations in cancer, we developed a filtering-based computational workflow and applied it to the Pan Cancer Analysis of Whole genomes (PCAWG) dataset. Predicted mutations were investigated using our new visualization framework, pSNV Hunter and prioritized for functional validation by luciferase assay. Here, we positively validated seven candidate pSNVs in vitro, including mutations within the promoters of ANKRD53 and MYB. Our analysis indicates that co-alterations, such as the overexpression or activation of the transcription factors, impact the effectiveness of functional pSNVs. Our analysis more than doubles the number of validated activating promoter mutations in cancer and demonstrates the effectiveness of our filtering pipeline, as well as, pSNV Hunter.
Additional Repositories
The publication references three additional tools that can be found at the following links: - pSNV Hunter: Comprehensive visualization tool / dashboard to investigate and select Promoter SNVs (pSNVs) for downstream validation - Deep Pileup: A quality control approach for evaluating individual genomic loci for potential signal noise - Genome Tornado Plots Wrapper: Analyzing Copy Number Variation (CNV) Events within the PCAWG dataset via GenomeTornadoPlot
Contact:
- Please contact Nicholas Abad (nicholas.a.abad@gmail.com) if you have any questions, comments or concerns.
Owner
- Login: nicholas-abad
- Kind: user
- Location: Heidelberg, Germany
- Website: https://www.linkedin.com/in/nicholasabad/
- Repositories: 7
- Profile: https://github.com/nicholas-abad
Machine Learning / Bioinformatics PhD Student at the DKFZ (German Cancer Research Institute)
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you'd like to cite the REMIND-Cancer computational pipeline, please use the following citation. However, if you'd like to cite the results, please cite the paper."
authors:
- family-names: Abad
given-names: Nicholas
orcid: https://orcid.org/0009-0004-8322-564X
title: "REMIND-Cancer: Identifying and Characterizing Functional Promoter SNVs"
version: 1.0
identifiers:
- type: doi
value: https://www.biorxiv.org/content/10.1101/2024.06.03.597231v1
date-released: 2024-04-24
GitHub Events
Total
- Release event: 1
- Delete event: 3
- Push event: 39
- Pull request event: 5
- Create event: 3
Last Year
- Release event: 1
- Delete event: 3
- Push event: 39
- Pull request event: 5
- Create event: 3
Dependencies
- Babel ==2.14.0
- Jinja2 ==3.1.3
- MarkupSafe ==2.1.4
- PyYAML ==6.0.1
- Pygments ==2.17.2
- Send2Trash ==1.8.2
- anyio ==4.2.0
- argon2-cffi ==23.1.0
- argon2-cffi-bindings ==21.2.0
- arrow ==1.3.0
- asttokens ==2.4.1
- async-lru ==2.0.4
- attrs ==23.2.0
- beautifulsoup4 ==4.12.3
- bleach ==6.1.0
- bs4 ==0.0.2
- certifi ==2023.11.17
- cffi ==1.16.0
- charset-normalizer ==3.3.2
- comm ==0.2.1
- debugpy ==1.8.0
- decorator ==5.1.1
- defusedxml ==0.7.1
- executing ==2.0.1
- fastjsonschema ==2.19.1
- fqdn ==1.5.1
- idna ==3.6
- ipykernel ==6.29.0
- ipython ==8.20.0
- isoduration ==20.11.0
- jedi ==0.19.1
- json5 ==0.9.14
- jsonpointer ==2.4
- jsonschema ==4.21.1
- jsonschema-specifications ==2023.12.1
- jupyter-events ==0.9.0
- jupyter-lsp ==2.2.2
- jupyter_client ==8.6.0
- jupyter_core ==5.7.1
- jupyter_server ==2.12.5
- jupyter_server_terminals ==0.5.2
- jupyterlab ==4.0.11
- jupyterlab_pygments ==0.3.0
- jupyterlab_server ==2.25.2
- matplotlib-inline ==0.1.6
- mistune ==3.0.2
- nbclient ==0.9.0
- nbconvert ==7.14.2
- nbformat ==5.9.2
- nest-asyncio ==1.6.0
- notebook ==7.0.7
- notebook_shim ==0.2.3
- numpy ==1.26.3
- overrides ==7.7.0
- packaging ==23.2
- pandas ==2.2.0
- pandocfilters ==1.5.1
- parso ==0.8.3
- pexpect ==4.9.0
- platformdirs ==4.1.0
- plotly ==5.18.0
- prometheus-client ==0.19.0
- prompt-toolkit ==3.0.43
- psutil ==5.9.8
- ptyprocess ==0.7.0
- pure-eval ==0.2.2
- pycparser ==2.21
- python-dateutil ==2.8.2
- python-json-logger ==2.0.7
- pytz ==2023.4
- pyzmq ==25.1.2
- referencing ==0.33.0
- requests ==2.31.0
- rfc3339-validator ==0.1.4
- rfc3986-validator ==0.1.1
- rpds-py ==0.17.1
- six ==1.16.0
- sniffio ==1.3.0
- soupsieve ==2.5
- stack-data ==0.6.3
- tenacity ==8.2.3
- terminado ==0.18.0
- tinycss2 ==1.2.1
- tornado ==6.4
- tqdm ==4.66.1
- traitlets ==5.14.1
- types-python-dateutil ==2.8.19.20240106
- tzdata ==2023.4
- uri-template ==1.3.0
- urllib3 ==2.1.0
- wcwidth ==0.2.13
- webcolors ==1.13
- webencodings ==0.5.1
- websocket-client ==1.7.0