bioinfo-targets

Workshop on modular, reproducible bioinformatics workflows with the {targets} R package

https://github.com/iscb-academy/bioinfo-targets

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.5%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Workshop on modular, reproducible bioinformatics workflows with the {targets} R package

Basic Info
  • Host: GitHub
  • Owner: ISCB-Academy
  • License: cc-by-4.0
  • Default Branch: main
  • Size: 18.6 KB
Statistics
  • Stars: 3
  • Watchers: 4
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 3 years ago · Last pushed over 3 years ago
Metadata Files
Readme Contributing License Code of conduct Citation

README.md

Modular, reproducible bioinformatics workflows with the {targets} R package

CC BY 4.0

Modern bioinformatics pipelines can be incredibly complex, but all tend to follow a common pattern: they start with raw data, then pass the data through various programs until arriving at a final result. If this is done in an ad-hoc, unorganized fashion, the results may never be reproducible or even worse, unreliable and/or wrong. Pipeline management software is therefore essential to obtain results that are robust and reproducible. The targets R package is a recently developed workflow manager that comes with many excellent features for bioinformatics, including data caching, pipeline-level parallelization, and HPC support. In this hands-on workshop, I will demonstrate how targets can be used in concert with other tools like docker and conda to orchestrate modular, reproducible bioinformatics pipelines. The workshop will feature variant-calling as an example, but the concepts and tools can be applied to nearly any analysis.

Pre-requisites: Basic familiarity with R. Installations of recent versions of R, RStudio, conda, and docker.

Duration: 2 hours

Time/Date: June 30, 2022 at 8:30 PM BST

Who is this for?

Anybody interested in maintaining bioinformatics pipelines with R. There are many pipeline tools available, but I can't tell you which is best for your project. For example, if you mostly do things in python, snakemake may be a better alternative, etc.

This workshop is primarily intended for people who already use R and want to use it to construct and maintain bioinformatics pipelines.

Format

The workshop is split into two parts:

  • Part I will introduce the {targets} R package and demonstrate its basic usage.
  • Part II will show how to use {targets} in combination with docker and conda to orchestrate bioinformatics workflows.

Resources

Objectives

By the end of the workshop, participants will be able to: - Use basic functions of the {targets} R package - Write wrapper functions calling programs to run in docker from R - Use the {targets} R package to construct bioinformatics pipelines

Not covered

This workshop will not include: - Details about docker - Details about R - Thorough explanation of variant calling

Contributing :hearts:

  • If you like it, leave your star in this project :star2:
  • If you would like to complain/suggest/contribute to this project, feel free to open a issue :heart_decoration:
  • Please follow our contributing guidelines.

Citation

If you use any of the materials of this lesson, please cite as:

Nitta, J.H. 2022. "Modular, reproducible bioinformatics workflows with the targets R package" https://github.com/ISCB-Academy/bioinfo-targets

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

CC BY 4.0

Owner

  • Name: ISCB Academy
  • Login: ISCB-Academy
  • Kind: organization

ISCB Academy is part of the ISCB, it aims to provide a hands-on experience accessing and using newly developed bioinformatics tools while ensuring best practice

Citation (CITATION.cff)

cff-version: 1.1.0
message: "If you use these materials, please cite as below."
authors:
  - family-names: Nitta
    given-names: Joel
    orcid: https://orcid.org/0000-0003-4719-7472
title: "Modular, reproducible bioinformatics workflows with the targets R package"
version: v1.0.0
date-released: 2022-06-27

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1