graditude

GRADitude - The GRAD-seq data analysis tool

https://github.com/foerstner-lab/graditude

Science Score: 64.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    1 of 7 committers (14.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.7%) to scientific vocabulary

Keywords from Contributors

mesh interactive
Last synced: 10 months ago · JSON representation ·

Repository

GRADitude - The GRAD-seq data analysis tool

Basic Info
Statistics
  • Stars: 1
  • Watchers: 2
  • Forks: 0
  • Open Issues: 7
  • Releases: 1
Created over 9 years ago · Last pushed 10 months ago
Metadata Files
Readme License Citation

README.md

DOI Latest Version

About GRADitude

GRADitude - The GRAD-seq data analysis tool

Grad-seq is a high-throughput profiling approach for the organism-wide detection of RNA-RNA and RNA-protein interactions in which molecular complexes are separated in a gradient by shape and size (Smirnov et al. 2016 PNAS). It offers new means to study the role of different RNA and protein components in various macromolecular assemblies by analyzing fractions of a glycerol gradient by a high- throughput sequencing approaches combined with mass spectrometry. The Grad-seq approach offers a way to study the distribution of all RNA involvement in various macromolecular assemblies.

GRADitude is a computational tool for the analysis of Grad-seq in-gradient profiling.

This open source tool performs all required steps to translate sequencing data of a Grad-seq experiment into a list of potential molecular complexes.

Documentation

Documentation can be found on here.

Installation

Current there is no proper pip package for GRADitude available - but it's work in progress. :)

Github

All the source code of GRADitude can be retrieve from our Git repository. Using the following commands can clone the source code easily.

$ git clone https://github.com/foerstner-lab/GRADitude.git

or

$ git clone git@github.com:foerstner-lab/GRADitude.git

In order to make GRADitude runnable, we have to create a soft link of graditudelib in bin.

$ cd GRADitude/bin

$ ln -s ../graditudelib .

Arguments

```bash usage: graditude [-h] [--version] {create,minrowsumercc,minrowsum,dropcolumn,robustregression,normalize,scaling,correlationallagainstall,selectingspecificfeatures,heatmap,plotkinetics,clustering,clusteringelbow,silhouetteanalysis,pca,tsne,umap,correlationrnasprotein,correlationdistributiongraph,plotnetworkgraph,clusteringproteins,dimensionreductionproteins,correlationspecificgene,interactiveplots,correlationreplicates,findcomplexes} ...

positional arguments: {create,minrowsumercc,minrowsum,dropcolumn,robustregression,normalize,scaling,correlationallagainstall,selectingspecificfeatures,heatmap,plotkinetics,clustering,clusteringelbow,silhouetteanalysis,pca,tsne,umap,correlationrnasprotein,correlationdistributiongraph,plotnetworkgraph,clusteringproteins,dimensionreductionproteins,correlationspecificgene,interactiveplots,correlationreplicates,findcomplexes} commands minrowsumercc Filter the ERCC table based on the min row sum. It calculates the sum rowwise and discard the rows with a sum below the threshold specified minrowsum Filter the gene quantification table based on the min row sum. It calculates the sum rowwise and discard the rows with a sum below the threshold specified dropcolumn It filters a table dropping a specific column.It is usually used to drop the lysate column that is not required for the downstream analysis robustregression It compares the ERCC concentration in mix with the ERCC reads and take it out the outliers normalize This subcommand calculates the ERCC size factor and normalize the gene quantification table based on that scaling This subcommand scales tables using different scaling methods correlationallagainstall This subcommand calculate the correlation coefficients all against all selectingspecificfeatures This subcommand allows to select specific features in a normalized table (ncRNAs, CDS, etc.) heatmap This subcommand is useful to visualize the in-gradient behavior of a larger group of transcripts or proteins plotkinetics This subcommand plot the kinetics of a specific transcript or protein to better visualize their behavior within the gradient clustering This subcommand performs unsupervised clustering using different algorithm clusteringelbow This subcommands plot the elbow graph in order to choose the ideal number of clusters necessary for the k-means and the hierarchical clustering silhouetteanalysis This subcommand can be used to interpret the distance between clusters pca This subcommand performs the PCA-principal component dimension reduction tsne This subcommand performs the t-sne dimension reduction umap This subcommand performs the umap dimension reduction correlationrnasprotein This subcommand performs the Spearman or Pearson correlation coefficients of two tables. correlationdistributiongraph This subcommand plots the distribution of the correlation coefficients as histogram plotnetworkgraph This subcommand plots the network plot. It can be used to plot for example sequencing data vs protein data or ncRNAs vs proteins etc. clusteringproteins This subcommand performs the unsupervised clustering of protein data dimensionreductionproteins t-sne analysis of Mass spectrometry data correlationspecificgene This subcommand calculate the Spearman or Pearson correlation of a specific gene or protein against all interactiveplots This subcommand is useful to visualize interactive a plot after a dimension reduction algorithm has been applied. correlationreplicates This subcommand allows to see the distribution of the correlation coefficient between two biological replicates findcomplexes With this subcommand we look at how many of the know proteincomplexes are actually present in our specific data sets.It finds if all the subunit of that specific complexes are present and calculate the correlation version Print version

optional arguments: -h, --help show this help message and exit ```

Owner

  • Name: Förstner Lab
  • Login: foerstner-lab
  • Kind: organization
  • Location: Cologne, Germany

Research Lab of Prof. Konrad Förstner at ZB MED - Information Center for Life Sciences

Citation (CITATION.cff)

cff-version: 1.2.0
message: If you use this software, please cite it as below.
authors:
  - family-names: Di Giorgio
    given-names: Silvia
    orcid: https://orcid.org/0000-0002-8565-1421
  - family-names: Förstner
    given-names: Konrad U.
    orcid: https://orcid.org/0000-0002-1481-2996
title: "GRADitude - The GRAD-seq data analysis tool"
version: 0.1.0
doi: 10.5281/zenodo.3911964
date-released: 2020-06-28

GitHub Events

Total
  • Push event: 1
Last Year
  • Push event: 1

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 411
  • Total Committers: 7
  • Avg Commits per committer: 58.714
  • Development Distribution Score (DDS): 0.372
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
silvia s****o@g****m 258
Konrad Förstner k****d@f****g 47
Silvia Di Giorgio d****o@z****e 44
Silvia Di Giorgio s****7 29
Silvia Di Giorgio s****o@u****e 28
Konrad Förstner k****d 3
dependabot[bot] 4****] 2
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 8
  • Total pull requests: 3
  • Average time to close issues: about 2 months
  • Average time to close pull requests: 19 days
  • Total issue authors: 2
  • Total pull request authors: 1
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 3
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • konrad (7)
  • vanlammessa (1)
Pull Request Authors
  • dependabot[bot] (3)
Top Labels
Issue Labels
Pull Request Labels
dependencies (3)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 17 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 4
  • Total maintainers: 2
pypi.org: graditude

A tool for the analysis of GRAD-seq data

  • Versions: 4
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 17 Last month
Rankings
Dependent packages count: 10.0%
Dependent repos count: 21.7%
Average: 27.0%
Forks count: 29.8%
Stargazers count: 31.9%
Downloads: 41.8%
Maintainers (2)
Last synced: 10 months ago

Dependencies

setup.py pypi
  • Jinja2 *
  • bokeh *
  • holoviews *
  • matplotlib *
  • networkx *
  • numpy *
  • pandas *
  • pytest *
  • scikit-learn *
  • scipy *
  • seaborn *
  • umap-learn *