1000-tools-paper

Code and analysis for the 1000 tools paper

https://github.com/scrna-tools/1000-tools-paper

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 8 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.6%) to scientific vocabulary

Keywords

analysis database publication reproducibility scrna-seq tools
Last synced: 6 months ago · JSON representation ·

Repository

Code and analysis for the 1000 tools paper

Basic Info
  • Host: GitHub
  • Owner: scRNA-tools
  • License: mit
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 20.6 MB
Statistics
  • Stars: 10
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 3
Topics
analysis database publication reproducibility scrna-seq tools
Created over 4 years ago · Last pushed over 4 years ago
Metadata Files
Readme License Citation

README.md

1000 tools paper

DOI gitmoji License: MIT <!-- badges: end -->

This repository contains code and analysis for the "Over 1000 tools reveal trends in the single-cell RNA-seq analysis landscape" publication.

Directory structure

  • R/ - Folder containing R code files. See documentation in files for more detail.
  • output/ - Output files created by the workflow. These versions are stored here as a record but will be overwritten by running the workflow.
    • data-tables/ - Data files created by the workflow
    • figures/ - Figures shown in the paper
    • supplementary/ - Supplementary figures shown in the paper
    • tables/ - Tables shown in the paper
  • .gitignore - Git configuration file
  • 1000-tools.Rproj - RStudio project file
  • LICENSE.md - License file
  • README.md - This README file
  • renv/ - Internal {renv} files
  • renv.lock - {renv} lock file specifying R package dependencies

Other files created by set up or the workflow

  • .Renviron - Local R configuration file, created as part of setting up
  • _cache/ - Cache files created by the workflow to speed up some parts of the analysis
  • _targets/ - Internal {targets} files

Setting up

R Dependencies

R package dependencies are managed using {renv}. They should be automatically installed when you start an R session inside the repository but to make sure run:

r renv::restore()

Crossref

Information about publications and preprints is retrieved from the Crossref API using the {rcrossref} package. As explained in ?rcrossref::`rcrossref-package` Crossref provides faster access to people who give an email address. To do this add the following line to your .Renviron:

crossref_email=your@email.com

johnnydep

Package dependencies for PyPI tools are retrieved using the johnnydep tool (https://pypi.org/project/johnnydep/). For these stages to work you must have johnnydep installed. The easiest way to do that is using pip or conda:

```bash pip install johnnydep

OR

conda install johnnydep ```

Once johnnydep is installed find the path to it using which and set a JOHNNYDEP_PATH variable in your .Renviron.

bash which johnnydep

JOHNNYDEP_PATH=/path/to/your/johnnydep

Fonts

To make sure fonts used in plots are available follow these steps:

  1. Download and install the Noto Sans and Noto Sans Maths fonts

    • For MacOS users the easiest way to do this is using Homebrew:

    bash brew install font-noto-sans font-noto-sans-math * Noto Sans is also available from Google Fonts

  2. Import fonts into R by running extrafont::font_import()

If these fonts are not available the plots will still be produced they will just use the standard default font.

Analytics

This workflow can also generate analysis of usage of the scRNA-tools website but it requires access to the Google Analytics group so will need to be switched off for most people

No analytics access (most people)

Edit the _targets.R file and make sure the include_analytics variable is set to FALSE.

r include_analytics <- FALSE

Analytics access

Data for plots showing usage statistics of the scRNA-tools website are collected using the {googleAnalyticsR} package. For this to work you must set up authentication with the googleAnalyticsR::ga_auth_setup() function following the instructions here https://code.markedmondson.me/googleAnalyticsR/articles/setup.html.

At the end of the process your .Renviron file should contain lines similar to these:

GAR_CLIENT_JSON=/path/to/oauth.json GARGLE_EMAIL=your@email.com

Running analysis

The analysis workflow is managed using {targets}. Once set up is complete you can run the workflow using:

r targets::tar_make()

Some of the steps (collecting reference and GitHub repository information) take a while to run. Once the workflow is complete various output files will be created in the output/ directory. If you want to view any of the intermediate parts of the workflow you can load the output of any target using:

r targets::tar_load(target_name)

Updating analysis

The analysis is pinned to a particular date and version of the scRNA-tools database. If you want to repeat the analysis for a more recent version edit _targets.R and modify the date target:

r tar_target( date, "YYYY-MM-DD" )

License

The code is available under the MIT license.

Citation

If you use any of the code in this repository or analysis in the publication please cite:

Zappia L, Theis FJ. "Over 1000 tools reveal trends in the single-cell RNA-seq analysis landscape", Genome Biology (2021), DOI: 10.1186/s13059-021-02519-4

```bibtex

@ARTICLE{Zappia2021-bc, title = "Over 1000 tools reveal trends in the single-cell {RNA-seq} analysis landscape", author = "Zappia, Luke and Theis, Fabian J", journal = "Genome Biol.", volume = 22, number = 1, pages = "301", month = oct, year = 2021, language = "en", doi = "10.1186/s13059-021-02519-4", url = "https://doi.org/10.1186/s13059-021-02519-4" } ```

Owner

  • Name: scRNA-tools
  • Login: scRNA-tools
  • Kind: organization

scRNA-tools is a catalogue of software tools for analysing scRNA-seq data

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use any of the code in this repository or analysis in the publication please cite as below."
authors:
- family-names: "Zappia"
  given-names: "Luke"
  orcid: "https://orcid.org/0000-0001-7744-8565"
- family-names: "Theis"
  given-names: "Fabian J"
  orcid: "https://orcid.org/0000-0002-2419-1943"
title: "1000 tools paper"
url: "https://github.com/scRNA-tools/1000-tools-paper"
preferred-citation:
  type: article
  authors:
  - family-names: "Zappia"
    given-names: "Luke"
    orcid: "https://orcid.org/0000-0001-7744-8565"
  - family-names: "Theis"
    given-names: "Fabian J"
    orcid: "https://orcid.org/0000-0002-2419-1943"
  doi: "10.1186/s13059-021-02519-4"
  journal: "Genome Biology"
  month: 10
  title: "Over 1000 tools reveal trends in the single-cell RNA-seq analysis landscape"
  year: 2021

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1