protein-metabolite
Code repository for "Protein-Metabolite Association Studies Identify Novel Proteomic Determinants of Metabolite Levels in Human Plasma"
Science Score: 41.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (2.6%) to scientific vocabulary
Repository
Code repository for "Protein-Metabolite Association Studies Identify Novel Proteomic Determinants of Metabolite Levels in Human Plasma"
Basic Info
Statistics
- Stars: 6
- Watchers: 1
- Forks: 1
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
View Results
View a browsable version of the Protein-Metabolite associations as a Shiny app at: https://mbenson.shinyapps.io/protein-metabolite
Protein-Metabolite Association Studies Identify Novel Proteomic Determinants of Metabolite Levels in Human Plasma
Mark D. Benson1, Aaron S. Eisman1, 2, Usman A. Tahir1, Daniel H. Katz1, Shuliang Deng1, Debby Ngo1, Jeremy M. Robbins1, Alissa Hofmann1, Xu Shi1, Shuning Zheng1, Michelle Keyes1, Zhi Yu3, Yan Gao4, Laurie Farrell1, Dongxiao Shen1, Zsu-Zsu Chen1, Daniel E. Cruz1, Mario Sims4, Adolfo Correa4, Russell P. Tracy5, Peter Durda5, Kent D. Taylor6, Yongmei Liu7, W. Craig Johnson8, Xiuqing Guo6, Jie Yao6, Yii-Der Ida Chen6, Ani W. Manichaikul9,10, Deepti Jain11, Qiong Yang12, NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium, Claude Bouchard13, Mark A. Sarzynski14, Stephen S. Rich9, Jerome I. Rotter6, Thomas J. Wang15, James G. Wilson1, Clary B. Clish3, Indra Neil Sarkar2, Pradeep Natarajan3, 16, 17, and Robert E. Gerszten1, 3†
- Division of Cardiovascular Medicine, Beth Israel Deaconess Medical Center, Boston, MA;
- Center for Biomedical Informatics, Brown University, Providence, RI;
- Broad Institute of Harvard and MIT, Cambridge, MA;
- University of Mississippi Medical Center, Jackson, MS;
- Department of Pathology Laboratory Medicine, Larner College of Medicine, University of Vermont, Burlington, VT;
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA;
- Department of Medicine, Division of Cardiology, Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC;
- Department of Biostatistics, University of Washington, Seattle, WA;
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA;
- Division of Biostatistics and Epidemiology, Department of Public Health Sciences, University of Virginia, Charlottesville, VA;
- University of Washington, Seattle, WA;
- Department of Biostatistics, Boston University School of Public Health, Boston, MA;
- Human Genomic Laboratory, Pennington Biomedical Research Center, Baton Rouge, LA;
- Department of Exercise Science, University of South Carolina, Columbia, SC;
- Department of Medicine, UT Southwestern Medical Center, Dallas, TX;
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA;
- Department of Medicine Harvard Medical School, Boston, MA
*indicates equal contribution in work and shared first co-authorship †corresponding author
Paper Abstract
While many novel gene-metabolite and gene-protein associations have been identified using high throughput biochemical profiling, systematic studies that leverage human genetics to illuminate novel protein-metabolite associations are lacking. Here, we performed protein-metabolite association studies in 3,626 plasma samples from three population studies. We detected over 171,800 protein-metabolite pairwise correlations between 1265 proteins and 365 metabolites (q-value ≤ 0.05). Our findings include correlations from central metabolic and signaling pathways, such as the protein thyroxine binding globulin and the metabolite thyroxine (r = 0.51, q-value ≤ 1.0 x 10-300), as well as thousands of novel findings. In enrichment analyses, we detected associations between plasma proteins and specific classes of amino acid, lipid and nucleotide metabolites, including associations between lipids and cathepsin proteases, serpin peptidase inhibitors, and secreted glycoproteins. In Mendelian Randomization analyses, we identified putative causal protein-to-metabolite associations, including associations between the scavenger receptor CD36 and circulating levels of dozens of lipid species such as arachidonic acid (beta 0.14, q-value 2.0 x 10-4) and docosahexaenoic acid (beta 0.16, q-value 1.1 x 10-4). We experimentally validated many of the top protein-to-metabolite associations in proof-of-concept plasma metabolomics studies in three murine knockout models. These analyses identified previously unrecognized associations between bioactive proteins and metabolites in human plasma. We provide publicly available data to be leveraged for studies in human metabolism and disease.
About this repository
This repository contains the code used to generate results for the above citation organized into analysis steps by folders.
data_prep
Protein, metabolite, and phenotype data from the Jackson Heart Study (JHS), the Multi-Ethnic Study of Atherosclerosis (MESA), and the Health, Risk Factors, Exercise Training and Genetics (HERITAGE) Family Study were ingested and put into a standardized form using the ``loadtransform'' scripts within the dataprep folder. In addition, a key was generated to translate between metabolite names across the three studies. Finally, a key was generated to translate between protein IDs in HERITAGE and the other two studies (JHS and MESA).
analysis
The analysis of these data was broken up into three parts. The first was a correlation analysis using the above-mentioned standardized and transformed versions of the raw protein and metabolite measurements. Next, a series of scripts were written to identify and extract genetic associations from previously performed protein and metabolite GWAS results that had been conducted on each of the three studies for mendelian randomization. Finally, those mendelian randomization analyses were meta-analyzed across studies.
tables
Scripts to generate all tables, including supplemental tables, are provided.
figures
Scripts to generate core components of figures are provided. Additional stylistic modifications (e.g. arrangement and font sizes) using Adobe Indesign and are not included here.
shinyapp
A Shiny data browser and associated data files are available for download. Fully contained within the shinyapp folder is the R code and data necessary to browse the detailed results on your personal computer.
Owner
- Name: Aaron Eisman
- Login: aeisman
- Kind: user
- Location: Providence, RI
- Company: Brown University
- Website: aaroneisman.com
- Repositories: 1
- Profile: https://github.com/aeisman
MD-PhD Student in Biomedical Informatics with the Center for Biomedical Informatics (@bcbi) at Brown University.
Citation (CITATION.cff)
cff-version: 1.1.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Eisman
given-names: Aaron
orcid: https://orcid.org/0000-0000-0000-0000
title: aeisman/protein-metabolite: Cell Metabolism Submission
version: v1.0.0.0
date-released: 2023-05-12
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0