allinone

Make Your Couch a Data Pre-Processing Center

https://github.com/mohsenyn/allinone

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: sciencedirect.com
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.0%) to scientific vocabulary

Keywords

agriculture plantbreeding preprocessing university
Last synced: 6 months ago · JSON representation ·

Repository

Make Your Couch a Data Pre-Processing Center

Basic Info
  • Host: GitHub
  • Owner: MohsenYN
  • License: mit
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 49.6 MB
Statistics
  • Stars: 14
  • Watchers: 3
  • Forks: 4
  • Open Issues: 0
  • Releases: 0
Topics
agriculture plantbreeding preprocessing university
Created over 3 years ago · Last pushed over 2 years ago
Metadata Files
Readme Contributing License Code of conduct Citation Security

README.md

AllInOne



The AllInOne - Pre-Processing is an open-Source, R-Shiny user interface package designed to provide a broad range of pre-processing analysis features for phenotypic datasets. This app uses different R packages, such as mice, VIM, lme4, bestNormalize, etc., to enable plant scientists to use all the mentioned packages simultaneously in an interactive environment. Furthermore, it allows plant scientists to edit, organize, subset, and sort datasets in a live mode.

If you enjoy working with AllInOne - Pre-Processing, give us a star on GitHub and Cite the package, please :)

Here is the AllInOne - Pre-Processing research paper.

Demo

Just click HERE.


Installation

Required: R version 4.0.0 or later
Required: Rstudio
Required: golem R package version 0.3.4 or later:
Required: shinydashboard R package version 0.7.2 or later
Required: shinydisconnect R package version 0.1.0 or later
Required: shinyjs R package version 2.1.0 or later
Required: SpATS R package version 1.0-16 or later
Required: remotes R package

r install.packages(c("remotes","golem","shinydashboard","shinydisconnect","shinyjs"))

Install using source

You can install the AllInOne - Pre-Processing using its source on your system like so:

r remotes::install_local('path/to/AllInOne-Pre-Processing/', force = TRUE)

Install from GitHub

You can install the AllInOne - Pre-Processing from GitHub like so:

r remotes::install_github('MohsenYN/AllInOne')

How to run

Run without installing

You can run the application by just running app.R without installing the package.

Run after installation

r AllInOne::run_app()

What do you think about AllInOne?

Let us know :)

Mohsen Yoosefzadeh Najafabadi

Soybean Breeding & Computational Biology
Department of Plant Agriculture
University of Guelph

ENJOY!



We are proud of our partners:













Owner

  • Name: Mohsen
  • Login: MohsenYN
  • Kind: user
  • Location: Guelph,Canada
  • Company: University of Guelph

PhD., Research Associate, Soybean Breeding & Computational Biology Department of Plant Agriculture University of Guelph

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: AllInOne
message: Make Your Couch a Data Pre-Processing Center
type: software
authors:
  - given-names: Mohsen
    family-names: Yoosefzadeh Najafabadi
    email: myoosefz@uoguelph.ca
    affiliation: Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada
    orcid: 'https://orcid.org/0000-0003-3631-4493'
  - orcid: 'https://orcid.org/0000-0003-2087-4306'
    email: Alihdr@stu.usc.ac.ir
    family-names: Heidari
    given-names: Ali
  - given-names: Istvan
    family-names: Rajcan
    email: Irajcan@uoguelph.ca
    orcid: 'https://orcid.org/0000-0001-5156-2482'
    affiliation: Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada
identifiers:
  - type: url
    value: 'https://github.com/MohsenYN/AllInOne'
repository-code: 'https://github.com/MohsenYN/AllInOne'
url: 'https://allinone.shinyapps.io/allinone/'
abstract: >-
  Plant breeding is a great mixture of art and science that
  has been around for centuries and plays an important role
  in solving many of the world’s agricultural problems. The
  ultimate goal for most plant breeding programs is to
  develop new varieties that are better adapted for certain
  environments, more resistant to biotic/abiotic stresses,
  and/or have increased yields. The process usually starts
  with selecting and crossing two varieties as parental
  lines with specific features to create a population with a
  high recombination rate. Afterward, the created population
  is evaluated for several generations to select the
  superior varieties based on the plant breeding goal. The
  first evaluation of the breeding population mainly relies
  on the phenotypic data derived from different plant
  varieties growing in multi-locations for several years.
  The collected phenotypic data can be further used in
  combination with the environment and omics data to
  increase the accuracy of the breeding decisions.
  Therefore, the accuracy, quality, and nature of phenotypic
  data are the three most important factors in capturing
  true signals and avoiding false interpretations. In
  addition, the proper selection of statistical analysis
  methods is of high paramount in making the right decision
  to select superior genotypes based on a trait of
  interest. 

  The accuracy of phenotypic data highly depends on how
  plant breeders collect data. Conventional data collection
  methods can bring more noise to the phenotypic data than
  high throughput methods. In addition, the contribution of
  more data collectors in measuring phenotypic data may
  decrease the level of accuracy. Moreover, the existence
  and abundance of genotypes that exhibit a particular
  phenotypic characteristic is an important determinant of
  the extent to which analysis methods can be used to select
  superior genotypes. Therefore, as plant breeders have been
  dealing with a large number of genotypes in different
  locations over several years, it is necessary to
  pre-process the phenotypic datasets before selecting any
  methods or making any decision. Pre-processing is one of
  the important procedures for increasing the quality and
  accuracy of the field phenotypic data, which is usually
  done in several steps, such as 1) detecting missing
  patterns in the dataset, 2) imputing missing data using
  different statistical methods, 3) data visualization in
  order to check data patterns and distribution, 4)
  detecting and refining outliers, 5) estimating
  correlations between dependant variables and also with and
  within independent variables, 6) normalizing data based on
  the optimum normalization methods for a given dataset, 6)
  estimating heritability and conducting spatial analysis,
  and finally, 7) calculating best linear unbiased
  prediction (BLUP) or/and best linear unbiased estimator
  (BLUE) based on the goal of the plant breeding program. 

  Several packages in different languages (mainly in R) have
  been created to handle different pre-processing steps,
  such as 1) MICE to deal with missing data, 2)
  Bestormalizer to normalize the datasets using different
  methods, 3) lme4 and ASReml R to handle variance analysis
  using different experimental designs, 4) ggplot for data
  visualization, etc. However, none of them are able to
  handle at least most of the pre-processing steps in plant
  breeding phenotypic datasets. In addition, they all
  require medium to advanced coding knowledge to run and
  adjust the functions based on the breeding preference.
  Furthermore, more sophisticated packages such as ASReml R
  are not free and require an annual renewal fee. Moreover,
  none of them provided a dynamical graphical interference
  for creating plots, detecting and refining outliers in a
  live mode, and using different datasets without directly
  importing from a file. 

  Here, we introduce the AllInOne package as an open-Source,
  breeder-friendly, analytical R package for pre-processing
  phenotypic data. The basis of AllInOne is to utilize
  different R packages and develop the pipeline for
  pre-processing the phenotypic datasets in an accurate,
  easy, and timely manner without any coding skills
  required. In addition, several new features and abilities
  were added in AllInOne that can complement previously
  developed packages in this area.
keywords:
  - Pre-processing
  - plant breeding
  - data collection
  - curation
  - R shiny package
  - agriculture
license: MIT
version: 1.8
date-released: '2023-1-18'

GitHub Events

Total
Last Year

Committers

Last synced: about 2 years ago

All Time
  • Total Commits: 387
  • Total Committers: 6
  • Avg Commits per committer: 64.5
  • Development Distribution Score (DDS): 0.506
Past Year
  • Commits: 387
  • Committers: 6
  • Avg Commits per committer: 64.5
  • Development Distribution Score (DDS): 0.506
Top Committers
Name Email Commits
Alihdr a****9@g****m 191
mohsen1080 m****z@u****a 110
Mohsen 5****N 54
Alijvhr e****r@g****m 14
Ali 3****r 11
Mohsen 5****0 7
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: about 2 years ago

All Time
  • Total issues: 0
  • Total pull requests: 10
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Total issue authors: 0
  • Total pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 10
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • MohsenYN (6)
  • alihdr (4)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

DESCRIPTION cran
  • R >= 2.10 depends
  • DT * imports
  • MASS * imports
  • RColorBrewer * imports
  • VIM * imports
  • bestNormalize * imports
  • car * imports
  • config * imports
  • corrplot * imports
  • data.table * imports
  • dplyr * imports
  • finalfit * imports
  • forcats * imports
  • ggplot2 * imports
  • ggpubr * imports
  • glmnet * imports
  • glue * imports
  • golem >= 0.3.4 imports
  • gridExtra * imports
  • lme4 * imports
  • mice * imports
  • naniar * imports
  • pkgload * imports
  • purrr * imports
  • readxl * imports
  • shiny * imports
  • shinydashboard >= 0.7.2 imports
  • shinydisconnect >= 0.1.0 imports
  • shinyjs >= 2.1.0 imports
  • stats * imports
  • stringr * imports
  • testthat * imports
  • tibble * imports
  • tidyr * imports
  • tidyverse * imports
  • usethis * imports
  • waiter * imports