graditude

GRADitude - The GRAD-seq data analysis tool

https://github.com/foerstner-lab/graditude

Science Score: 64.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: zenodo.org
✓
Committers with academic emails
1 of 7 committers (14.3%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (10.7%) to scientific vocabulary

Keywords from Contributors

mesh interactive

Last synced: 10 months ago · JSON representation ·

Repository

GRADitude - The GRAD-seq data analysis tool

Basic Info

Host: GitHub
Owner: foerstner-lab
License: other
Language: Python
Default Branch: main
Homepage: https://foerstner-lab.github.io/GRADitude/
Size: 9.06 MB

Statistics

Stars: 1
Watchers: 2
Forks: 0
Open Issues: 7
Releases: 1

Created over 9 years ago · Last pushed 10 months ago

Metadata Files

Readme License Citation

About GRADitude

GRADitude - The GRAD-seq data analysis tool

Grad-seq is a high-throughput profiling approach for the organism-wide detection of RNA-RNA and RNA-protein interactions in which molecular complexes are separated in a gradient by shape and size (Smirnov et al. 2016 PNAS). It offers new means to study the role of different RNA and protein components in various macromolecular assemblies by analyzing fractions of a glycerol gradient by a high- throughput sequencing approaches combined with mass spectrometry. The Grad-seq approach offers a way to study the distribution of all RNA involvement in various macromolecular assemblies.

GRADitude is a computational tool for the analysis of Grad-seq in-gradient profiling.

This open source tool performs all required steps to translate sequencing data of a Grad-seq experiment into a list of potential molecular complexes.

Documentation

Documentation can be found on here.

Installation

Current there is no proper pip package for GRADitude available - but it's work in progress. :)

Github

All the source code of GRADitude can be retrieve from our Git repository. Using the following commands can clone the source code easily.

$ git clone https://github.com/foerstner-lab/GRADitude.git

$ git clone git@github.com:foerstner-lab/GRADitude.git

In order to make GRADitude runnable, we have to create a soft link of graditudelib in bin.

$ cd GRADitude/bin

$ ln -s ../graditudelib .

Arguments

```bash usage: graditude [-h] [--version] {create,minrowsumercc,minrowsum,dropcolumn,robustregression,normalize,scaling,correlationallagainstall,selectingspecificfeatures,heatmap,plotkinetics,clustering,clusteringelbow,silhouetteanalysis,pca,tsne,umap,correlationrnasprotein,correlationdistributiongraph,plotnetworkgraph,clusteringproteins,dimensionreductionproteins,correlationspecificgene,interactiveplots,correlationreplicates,findcomplexes} ...

positional arguments: {create,minrowsumercc,minrowsum,dropcolumn,robustregression,normalize,scaling,correlationallagainstall,selectingspecificfeatures,heatmap,plotkinetics,clustering,clusteringelbow,silhouetteanalysis,pca,tsne,umap,correlationrnasprotein,correlationdistributiongraph,plotnetworkgraph,clusteringproteins,dimensionreductionproteins,correlationspecificgene,interactiveplots,correlationreplicates,findcomplexes} commands minrowsumercc Filter the ERCC table based on the min row sum. It calculates the sum rowwise and discard the rows with a sum below the threshold specified minrowsum Filter the gene quantification table based on the min row sum. It calculates the sum rowwise and discard the rows with a sum below the threshold specified dropcolumn It filters a table dropping a specific column.It is usually used to drop the lysate column that is not required for the downstream analysis robustregression It compares the ERCC concentration in mix with the ERCC reads and take it out the outliers normalize This subcommand calculates the ERCC size factor and normalize the gene quantification table based on that scaling This subcommand scales tables using different scaling methods correlationallagainstall This subcommand calculate the correlation coefficients all against all selectingspecificfeatures This subcommand allows to select specific features in a normalized table (ncRNAs, CDS, etc.) heatmap This subcommand is useful to visualize the in-gradient behavior of a larger group of transcripts or proteins plotkinetics This subcommand plot the kinetics of a specific transcript or protein to better visualize their behavior within the gradient clustering This subcommand performs unsupervised clustering using different algorithm clusteringelbow This subcommands plot the elbow graph in order to choose the ideal number of clusters necessary for the k-means and the hierarchical clustering silhouetteanalysis This subcommand can be used to interpret the distance between clusters pca This subcommand performs the PCA-principal component dimension reduction tsne This subcommand performs the t-sne dimension reduction umap This subcommand performs the umap dimension reduction correlationrnasprotein This subcommand performs the Spearman or Pearson correlation coefficients of two tables. correlationdistributiongraph This subcommand plots the distribution of the correlation coefficients as histogram plotnetworkgraph This subcommand plots the network plot. It can be used to plot for example sequencing data vs protein data or ncRNAs vs proteins etc. clusteringproteins This subcommand performs the unsupervised clustering of protein data dimensionreductionproteins t-sne analysis of Mass spectrometry data correlationspecificgene This subcommand calculate the Spearman or Pearson correlation of a specific gene or protein against all interactiveplots This subcommand is useful to visualize interactive a plot after a dimension reduction algorithm has been applied. correlationreplicates This subcommand allows to see the distribution of the correlation coefficient between two biological replicates findcomplexes With this subcommand we look at how many of the know proteincomplexes are actually present in our specific data sets.It finds if all the subunit of that specific complexes are present and calculate the correlation version Print version

optional arguments: -h, --help show this help message and exit ```

Owner

Name: Förstner Lab
Login: foerstner-lab
Kind: organization
Location: Cologne, Germany

Repositories: 22
Profile: https://github.com/foerstner-lab

Research Lab of Prof. Konrad Förstner at ZB MED - Information Center for Life Sciences

Citation (CITATION.cff)

cff-version: 1.2.0
message: If you use this software, please cite it as below.
authors:
  - family-names: Di Giorgio
    given-names: Silvia
    orcid: https://orcid.org/0000-0002-8565-1421
  - family-names: Förstner
    given-names: Konrad U.
    orcid: https://orcid.org/0000-0002-1481-2996
title: "GRADitude - The GRAD-seq data analysis tool"
version: 0.1.0
doi: 10.5281/zenodo.3911964
date-released: 2020-06-28

GitHub Events

Total

Push event: 1

Last Year

Push event: 1

Committers

Last synced: over 2 years ago

All Time

Total Commits: 411
Total Committers: 7
Avg Commits per committer: 58.714
Development Distribution Score (DDS): 0.372

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
silvia	s**o@g**m	258
Konrad Förstner	k**d@f**g	47
Silvia Di Giorgio	d**o@z**e	44
Silvia Di Giorgio	s****7	29
Silvia Di Giorgio	s**o@u**e	28
Konrad Förstner	k****d	3
dependabot[bot]	4****]	2

Committer Domains (Top 20 + Academic)

uni-wuerzburg.de: 1 zbmed.de: 1 foerstner.org: 1

Issues and Pull Requests

Last synced: 10 months ago

All Time

Total issues: 8
Total pull requests: 3
Average time to close issues: about 2 months
Average time to close pull requests: 19 days
Total issue authors: 2
Total pull request authors: 1
Average comments per issue: 0.0
Average comments per pull request: 0.0
Merged pull requests: 2
Bot issues: 0
Bot pull requests: 3

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

konrad (7)
vanlammessa (1)

Pull Request Authors

dependabot[bot] (3)

Top Labels

Issue Labels

Pull Request Labels

dependencies (3)

Packages

Total packages: 1
Total downloads:
- pypi 17 last-month

Total dependent packages: 0
Total dependent repositories: 1
Total versions: 4
Total maintainers: 2

pypi.org: graditude

A tool for the analysis of GRAD-seq data

Homepage: https://github.com/foerstner-lab/GRADitude.git
Documentation: https://graditude.readthedocs.io/
License: ISC License (ISCL)
Latest release: 0.1.3
published about 5 years ago

Versions: 4
Dependent Packages: 0
Dependent Repositories: 1
Downloads: 17 Last month

Rankings

Dependent packages count: 10.0%

Dependent repos count: 21.7%

Average: 27.0%

Forks count: 29.8%

Stargazers count: 31.9%

Downloads: 41.8%

Maintainers (2)

konrad silviadigiorgio

Last synced: 10 months ago

Dependencies

setup.py pypi

Jinja2 *
bokeh *
holoviews *
matplotlib *
networkx *
numpy *
pandas *
pytest *
scikit-learn *
scipy *
seaborn *
umap-learn *

graditude

Science Score: 64.0%

Keywords from Contributors

Repository

Basic Info

Statistics

Metadata Files

README.md

About GRADitude

Documentation

Installation

Github

Arguments

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: graditude

Rankings

Maintainers (2)

Dependencies