patRoon 2.0

patRoon 2.0: Improved non-target analysis workflows including automated transformation product screening - Published in JOSS (2022)

https://github.com/rickhelmus/patroon

Science Score: 100.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 14 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: joss.theoj.org, zenodo.org
  • Committers with academic emails
    1 of 8 committers (12.5%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

mass-spectrometry non-target r

Scientific Fields

Sociology Social Sciences - 87% confidence
Last synced: 4 months ago · JSON representation ·

Repository

Workflow solutions for mass-spectrometry based non-target analysis.

Basic Info
Statistics
  • Stars: 66
  • Watchers: 8
  • Forks: 22
  • Open Issues: 19
  • Releases: 14
Topics
mass-spectrometry non-target r
Created almost 8 years ago · Last pushed 4 months ago
Metadata Files
Readme Changelog License Citation

README.md

patRoon

CircleCI btd codecov patRoon r-universe DOI:10.1186/s13321-020-00477-w DOI REUSE status

patRoon aims to provide comprehensive mass spectrometry based non-target analysis (NTA) workflows for environmental analysis. The name is derived from a Dutch word that means pattern and may also be an acronym for hyPhenated mAss specTROmetry nOn-target aNalysis.

Project news

October 2024 patRoon 2.3.3 is now released. This is a small maintenance release which correct the SIRIUS login procedure and comes with several other bug fixes. Please see the Project NEWS for more details.

July 2024 patRoon 2.3.2 is now released. This is a small maintenance release with several important bug fixes and new functions to inspect raw MS data (thanks to Ricardo Cunha). Please see the Project NEWS for more details.

December 2023 patRoon 2.3.1 is now released. This is a small maintenance release with several bug fixes. The most important fix is a correction for the prediction of concentrations from SIRIUS fingerprints. Please see the Project NEWS for more details and important notes on updating patRoon.

November 2023 patRoon 2.3 is now released. The most significant changes include improved installation strategies and integration of MS2Tox and MS2Quant to predict feature toxicities and concentrations. Please see the Project NEWS for more details and important notes on updating patRoon.

May 2023 patRoon 2.2 is now released. The most significant change is the addition of a new reporting interface, which brings a much improved HTML interface, many optimizations and other important new functionality. Furthermore, patRoon 2.2 introduces improved SIRIUS 5 support, a new TP screening algorithm using formula libraries and many other improvements, of which many thanks to the great user feedback. Please see the Project NEWS for details.

March 2023 The Docker images moved to a new host. Please see the see installation details in the handbook to obtain the latest images.

May 2022 patRoon 2.1 is now available. This new release integrates prediction of transformation products with CTS, adds several feature intensity normalization methods, adds new functionality and improvements for reporting TP data and supports loading, processing and annotation with MS libraries such as MassBank. Please see the Project NEWS for details.

Introduction

Mass spectrometry based non-target analysis is used to screen large numbers of chemicals simultaneously. For this purpose, high resolution mass spectrometry instruments are used which are typically coupled (or hyphenated) with chromatography (e.g. LC or GC). The size and complexity of resulting data makes manual processing impractical. Many software tools were/are developed to facilitate a more automated approach. However, these tools are generally not optimized for environmental workflows and/or only implement parts of the functionality required.

patRoon combines established software tools with novel functionality in order to provide comprehensive NTA workflows. The different algorithms are provided through a consistent interface, which removes the need to know all the details of each individual software tool and performing tedious data conversions during the workflow. The table below outlines the major functionality of patRoon.

Functionality | Description | Algorithms ---------------------- | ------------------------------------------------------------------------ | ----------------------- Raw data pre-treatment | MS format conversion (e.g. vendor to mzML) and calibration. | ProteoWizard, OpenMS, DataAnalysis Feature extraction | Finding features and grouping them across analyses. | XCMS, OpenMS, enviPick, DataAnalysis, KPIC2, SIRIUS, SAFD Suspect screening | Finding features with suspected presence by MS and chromatographic data. Estimation of identification confidence levels. | Native MS data extraction | Automatic extraction and averaging of feature MS(/MS) peak lists. | Native, mzR, DataAnalysis Formula annotation | Automatic calculation of formula candidates for features. | GenForm, SIRIUS, DataAnalysis Compound annotation | Automatic (in silico) compound annotation of features. | MetFrag, SIRIUS, Native Componentization & adduct annotation | Grouping of related features based on chemistry (e.g. isotopes, adducts and homologs), hierarchical clustering or MS/MS similarity into components. Using adduct and isotope annotations for prioritizing features and improving formula/compound annotations. | RAMClustR, CAMERA, nontarget R package, OpenMS, cliqueMS, Native Combining algorithms | Combine data from different algorithms (e.g. features, annotations) and generate a consensus. | Native Sets workflows | Simultaneous processing and combining +/- MS ionization data | Native Transformation product (TP) screening | Automatic screening of TPs using library/in-silico data, MS similarities and classifications. Tools to improve compound TP annotation. | BioTransformer, PubChemLite, Native Reporting | Automatic reporting of all important workflow data. An example HTML report can be viewed here. | Native Data clean-up & prioritization | Filters for blanks, replicates, intensity thresholds, neutral losses, annotation scores, identification levels and many more. | Native Data curation | Several graphical interactive tools and functions to inspect and remove unwanted data. | Native

The workflow of non-target analysis typically depends on the aims and requirements of the study and the instrumentation and methodology used for sample analysis. For this reason, patRoon does not enforce a certain workflow. Instead, most workflow steps are optional, fully configurable and algorithms can easily be mixed or even combined.

Implementation details

  • patRoon is implemented as an R package, which allows easy interfacing with the many other R based MS tools and other data processing functionality from R.
  • Fully open-source (GPLv3).
  • Developed on Windows, Linux and macOS
  • S4 classes and generics are used to implement a consistent interface to all supported algorithms.
  • Continuous integration is used to automatically perform unit tests, update the Website and documentation, and maintaining installation resources such as the patRoonDeps repository, Docker image and patRoon bundle (see the handbook for more details).
  • Supports all major instrument vendor input formats (through usage of ProteoWizard and DataAnalysis).
  • Optimizations
    • data.table is used internally as a generally much more efficient alternative to data.frame.
    • The processx and future R packages are used for parallelization.
    • Results from workflow steps are cached within a SQLite database to avoid repeated computations.
    • Code for loading MS and EIC data, MS similarity calculations and others were implemented in C++ to reduce computational times.
  • The RDCOMClient R package is used to interface with Bruker DataAnalysis algorithms.
  • The Shiny R package was used to implement several GUI tools.
  • The reporting functionality relies on the excellent R markdown and related packages such as flexdashboard, bslib and reactable.

Installation

patRoon and its dependencies can be installed in various ways. Please see the installation section in the handbook for more information.

Getting started

For a very quick start:

r library(patRoon) newProject()

The newProject() function will pop-up a dialog screen (requires R Studio), which will allow you to quickly select the analyses and common workflow options to subsequently generate a template R processing script.

However, for a better guide to get started it is recommended to read the tutorial. Afterwards the handbook is a recommended read if you want to know more about advanced usage of patRoon. Finally, the reference outlines all the details of the patRoon package.

Citing

When you use patRoon please cite its publications:

Rick Helmus, Thomas L. ter Laak, Annemarie P. van Wezel, Pim de Voogt and Emma L. Schymanski. patRoon: open source software platform for environmental mass spectrometry based non-target screening. Journal of Cheminformatics 13, 1 (2021)

Rick Helmus, Bas van de Velde, Andrea M. Brunner, Thomas L. ter Laak, Annemarie P. van Wezel and Emma L. Schymanski. patRoon 2.0: Improved non-target analysis workflows including automated transformation product screening. Journal of Open Source Software, 7(71), 4029

patRoon builds on many open-source software tools and open data sources. Therefore, it is important to also cite their work when using these algorithms via patRoon.

Contributing

For bug reports, code contributions (pull requests), questions, suggestions and general feedback please use the GitHub page.

Owner

  • Login: rickhelmus
  • Kind: user

JOSS Publication

patRoon 2.0: Improved non-target analysis workflows including automated transformation product screening
Published
March 12, 2022
Volume 7, Issue 71, Page 4029
Authors
Rick Helmus ORCID
Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, P.O. Box 94240, 1090 GE Amsterdam, The Netherlands.
Bas van de Velde ORCID
Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, P.O. Box 94240, 1090 GE Amsterdam, The Netherlands., KWR Water Research Institute, Chemical Water Quality and Health, P.O. Box 1072, 3430 BB Nieuwegein, The Netherlands., Division of BioAnalytical Chemistry, Amsterdam Institute of Molecular and Life Sciences (AIMMS), Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands.
Andrea M. Brunner ORCID
KWR Water Research Institute, Chemical Water Quality and Health, P.O. Box 1072, 3430 BB Nieuwegein, The Netherlands., TNO, Environmental Modelling, Sensing and Analysis (EMSA), Princetonlaan 8, 3584 CB, Utrecht, The Netherlands.
Thomas L. ter Laak ORCID
Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, P.O. Box 94240, 1090 GE Amsterdam, The Netherlands., KWR Water Research Institute, Chemical Water Quality and Health, P.O. Box 1072, 3430 BB Nieuwegein, The Netherlands.
Annemarie P. van Wezel ORCID
Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, P.O. Box 94240, 1090 GE Amsterdam, The Netherlands.
Emma L. Schymanski ORCID
Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, L-4367 Belvaux, Luxembourg.
Editor
Kristina Riemer ORCID
Tags
R non-target screening transformation products high resolution mass spectrometry

Citation (CITATION.cff)

cff-version: 1.0.3
authors:
  -
    family-names: Helmus
    given-names: Rick
  -
    family-names: 'ter Laak'
    given-names: 'Thomas L.'
  -
    family-names: 'van Wezel'
    given-names: 'Annemarie P.'
  -
    family-names: 'de Voogt'
    given-names: Pim
  -
    family-names: Schymanski
    given-names: 'Emma L.'
date-released: 2021-01-06T00:00:00.000Z
doi: 10.1186/s13321-020-00477-w
title: 'patRoon: open source software platform for environmental mass spectrometry based non-target screening'
url: 'http://dx.doi.org/10.1186/s13321-020-00477-w'

GitHub Events

Total
  • Create event: 1
  • Issues event: 22
  • Release event: 1
  • Watch event: 4
  • Issue comment event: 49
  • Push event: 197
  • Pull request review comment event: 4
  • Pull request review event: 5
  • Pull request event: 6
  • Fork event: 5
Last Year
  • Create event: 1
  • Issues event: 23
  • Release event: 1
  • Watch event: 4
  • Issue comment event: 49
  • Push event: 197
  • Pull request review comment event: 4
  • Pull request review event: 5
  • Pull request event: 6
  • Fork event: 5

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 4,524
  • Total Committers: 8
  • Avg Commits per committer: 565.5
  • Development Distribution Score (DDS): 0.072
Past Year
  • Commits: 131
  • Committers: 3
  • Avg Commits per committer: 43.667
  • Development Distribution Score (DDS): 0.397
Top Committers
Name Email Commits
Rick Helmus r****s@u****l 4,200
CircleCI updater s****g@e****m 297
LeonSaal l****l@p****e 11
Ricardo Cunha c****a@i****e 8
Rick Helmus r****s@g****m 5
Ricardo Cunha 6****a 1
Egon Willighagen e****n@g****m 1
Colton Lloyd c****6@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 89
  • Total pull requests: 17
  • Average time to close issues: about 2 months
  • Average time to close pull requests: 22 days
  • Total issue authors: 32
  • Total pull request authors: 4
  • Average comments per issue: 2.46
  • Average comments per pull request: 1.24
  • Merged pull requests: 9
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 16
  • Pull requests: 6
  • Average time to close issues: 6 days
  • Average time to close pull requests: 16 days
  • Issue authors: 6
  • Pull request authors: 3
  • Average comments per issue: 1.44
  • Average comments per pull request: 1.5
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • LeonSaal (26)
  • hechth (15)
  • Boris-Droz (7)
  • drewszabo (5)
  • akogler (4)
  • Theodoralee03 (2)
  • bhc928 (2)
  • rachsmo (2)
  • MK3491 (2)
  • tsufz (2)
  • Theodoralee27 (1)
  • mingw86 (1)
  • Begotalavera (1)
  • dgcovell (1)
  • VR-13 (1)
Pull Request Authors
  • ricardo-cunha (8)
  • LeonSaal (7)
  • Theodora-lhy (1)
  • coltonlloyd (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

DESCRIPTION cran
  • R >= 3.5.0 depends
  • Biobase * imports
  • BiocParallel * imports
  • CAMERA * imports
  • DBI * imports
  • DT * imports
  • MSnbase * imports
  • R.utils * imports
  • RColorBrewer * imports
  • RSQLite >= 2.2.4 imports
  • Rcpp * imports
  • Rdpack * imports
  • UpSetR * imports
  • VennDiagram * imports
  • backports * imports
  • checkmate >= 1.8.5 imports
  • circlize * imports
  • cluster * imports
  • data.table * imports
  • dendextend * imports
  • digest * imports
  • dynamicTreeCut * imports
  • enviPat * imports
  • fastcluster * imports
  • fingerprint * imports
  • flexdashboard * imports
  • fs * imports
  • fst * imports
  • future * imports
  • future.apply * imports
  • glue * imports
  • gplots * imports
  • graphics * imports
  • grid * imports
  • heatmaply * imports
  • htmlwidgets * imports
  • httr * imports
  • igraph * imports
  • jsonlite * imports
  • kableExtra * imports
  • keys * imports
  • knitr * imports
  • magick * imports
  • magrittr * imports
  • methods * imports
  • miniUI * imports
  • mzR * imports
  • parallel * imports
  • processx * imports
  • rJava * imports
  • rcdk * imports
  • rhandsontable * imports
  • rmarkdown * imports
  • rsm * imports
  • rstudioapi * imports
  • shiny * imports
  • shinyjs * imports
  • stats * imports
  • tools * imports
  • utils * imports
  • visNetwork * imports
  • withr * imports
  • xcms * imports
  • yaml * imports
  • DiagrammeR * suggests
  • DiagrammeRsvg * suggests
  • KPIC * suggests
  • MetaClean * suggests
  • MetaCleanData * suggests
  • RAMClustR * suggests
  • RDCOMClient * suggests
  • covr * suggests
  • devtools * suggests
  • enviPick * suggests
  • metfRag * suggests
  • nontarget * suggests
  • patRoonData * suggests
  • pkgload * suggests
  • rlang * suggests
  • rsvg * suggests
  • splashR * suggests
  • testthat * suggests
  • vdiffr * suggests
docker/deps/Dockerfile docker
  • rocker/rstudio 4.2 build
docker/rstudio/Dockerfile docker
  • patroonorg/patroondeps latest build