Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: sciencedirect.com -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.0%) to scientific vocabulary
Keywords
Repository
Make Your Couch a Data Pre-Processing Center
Basic Info
Statistics
- Stars: 14
- Watchers: 3
- Forks: 4
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
AllInOne 
The AllInOne - Pre-Processing is an open-Source, R-Shiny user interface package designed to provide a broad range of pre-processing analysis features for phenotypic datasets. This app uses different R packages, such as mice, VIM, lme4, bestNormalize, etc., to enable plant scientists to use all the mentioned packages simultaneously in an interactive environment. Furthermore, it allows plant scientists to edit, organize, subset, and sort datasets in a live mode.
If you enjoy working with AllInOne - Pre-Processing, give us a star on GitHub and Cite the package, please :)
Here is the AllInOne - Pre-Processing research paper.
Demo
Just click HERE.
Installation
Required: R version 4.0.0 or later
Required: Rstudio
Required: golem R package version 0.3.4 or later:
Required: shinydashboard R package version 0.7.2 or later
Required: shinydisconnect R package version 0.1.0 or later
Required: shinyjs R package version 2.1.0 or later
Required: SpATS R package version 1.0-16 or later
Required: remotes R package
r
install.packages(c("remotes","golem","shinydashboard","shinydisconnect","shinyjs"))
Install using source
You can install the AllInOne - Pre-Processing using its source on your system like so:
r
remotes::install_local('path/to/AllInOne-Pre-Processing/', force = TRUE)
Install from GitHub
You can install the AllInOne - Pre-Processing from GitHub like so:
r
remotes::install_github('MohsenYN/AllInOne')
How to run
Run without installing
You can run the application by just running app.R without installing the package.
Run after installation
r
AllInOne::run_app()
What do you think about AllInOne?
Let us know :)
Soybean Breeding & Computational Biology
Department of Plant Agriculture
University of Guelph
ENJOY!
We are proud of our partners:
Owner
- Name: Mohsen
- Login: MohsenYN
- Kind: user
- Location: Guelph,Canada
- Company: University of Guelph
- Website: https://www.oaccbc.uoguelph.ca/
- Twitter: Mohsen1080
- Repositories: 2
- Profile: https://github.com/MohsenYN
PhD., Research Associate, Soybean Breeding & Computational Biology Department of Plant Agriculture University of Guelph
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: AllInOne
message: Make Your Couch a Data Pre-Processing Center
type: software
authors:
- given-names: Mohsen
family-names: Yoosefzadeh Najafabadi
email: myoosefz@uoguelph.ca
affiliation: Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada
orcid: 'https://orcid.org/0000-0003-3631-4493'
- orcid: 'https://orcid.org/0000-0003-2087-4306'
email: Alihdr@stu.usc.ac.ir
family-names: Heidari
given-names: Ali
- given-names: Istvan
family-names: Rajcan
email: Irajcan@uoguelph.ca
orcid: 'https://orcid.org/0000-0001-5156-2482'
affiliation: Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada
identifiers:
- type: url
value: 'https://github.com/MohsenYN/AllInOne'
repository-code: 'https://github.com/MohsenYN/AllInOne'
url: 'https://allinone.shinyapps.io/allinone/'
abstract: >-
Plant breeding is a great mixture of art and science that
has been around for centuries and plays an important role
in solving many of the world’s agricultural problems. The
ultimate goal for most plant breeding programs is to
develop new varieties that are better adapted for certain
environments, more resistant to biotic/abiotic stresses,
and/or have increased yields. The process usually starts
with selecting and crossing two varieties as parental
lines with specific features to create a population with a
high recombination rate. Afterward, the created population
is evaluated for several generations to select the
superior varieties based on the plant breeding goal. The
first evaluation of the breeding population mainly relies
on the phenotypic data derived from different plant
varieties growing in multi-locations for several years.
The collected phenotypic data can be further used in
combination with the environment and omics data to
increase the accuracy of the breeding decisions.
Therefore, the accuracy, quality, and nature of phenotypic
data are the three most important factors in capturing
true signals and avoiding false interpretations. In
addition, the proper selection of statistical analysis
methods is of high paramount in making the right decision
to select superior genotypes based on a trait of
interest.
The accuracy of phenotypic data highly depends on how
plant breeders collect data. Conventional data collection
methods can bring more noise to the phenotypic data than
high throughput methods. In addition, the contribution of
more data collectors in measuring phenotypic data may
decrease the level of accuracy. Moreover, the existence
and abundance of genotypes that exhibit a particular
phenotypic characteristic is an important determinant of
the extent to which analysis methods can be used to select
superior genotypes. Therefore, as plant breeders have been
dealing with a large number of genotypes in different
locations over several years, it is necessary to
pre-process the phenotypic datasets before selecting any
methods or making any decision. Pre-processing is one of
the important procedures for increasing the quality and
accuracy of the field phenotypic data, which is usually
done in several steps, such as 1) detecting missing
patterns in the dataset, 2) imputing missing data using
different statistical methods, 3) data visualization in
order to check data patterns and distribution, 4)
detecting and refining outliers, 5) estimating
correlations between dependant variables and also with and
within independent variables, 6) normalizing data based on
the optimum normalization methods for a given dataset, 6)
estimating heritability and conducting spatial analysis,
and finally, 7) calculating best linear unbiased
prediction (BLUP) or/and best linear unbiased estimator
(BLUE) based on the goal of the plant breeding program.
Several packages in different languages (mainly in R) have
been created to handle different pre-processing steps,
such as 1) MICE to deal with missing data, 2)
Bestormalizer to normalize the datasets using different
methods, 3) lme4 and ASReml R to handle variance analysis
using different experimental designs, 4) ggplot for data
visualization, etc. However, none of them are able to
handle at least most of the pre-processing steps in plant
breeding phenotypic datasets. In addition, they all
require medium to advanced coding knowledge to run and
adjust the functions based on the breeding preference.
Furthermore, more sophisticated packages such as ASReml R
are not free and require an annual renewal fee. Moreover,
none of them provided a dynamical graphical interference
for creating plots, detecting and refining outliers in a
live mode, and using different datasets without directly
importing from a file.
Here, we introduce the AllInOne package as an open-Source,
breeder-friendly, analytical R package for pre-processing
phenotypic data. The basis of AllInOne is to utilize
different R packages and develop the pipeline for
pre-processing the phenotypic datasets in an accurate,
easy, and timely manner without any coding skills
required. In addition, several new features and abilities
were added in AllInOne that can complement previously
developed packages in this area.
keywords:
- Pre-processing
- plant breeding
- data collection
- curation
- R shiny package
- agriculture
license: MIT
version: 1.8
date-released: '2023-1-18'
GitHub Events
Total
Last Year
Committers
Last synced: about 2 years ago
Top Committers
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: about 2 years ago
All Time
- Total issues: 0
- Total pull requests: 10
- Average time to close issues: N/A
- Average time to close pull requests: less than a minute
- Total issue authors: 0
- Total pull request authors: 2
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 10
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: less than a minute
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- MohsenYN (6)
- alihdr (4)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- R >= 2.10 depends
- DT * imports
- MASS * imports
- RColorBrewer * imports
- VIM * imports
- bestNormalize * imports
- car * imports
- config * imports
- corrplot * imports
- data.table * imports
- dplyr * imports
- finalfit * imports
- forcats * imports
- ggplot2 * imports
- ggpubr * imports
- glmnet * imports
- glue * imports
- golem >= 0.3.4 imports
- gridExtra * imports
- lme4 * imports
- mice * imports
- naniar * imports
- pkgload * imports
- purrr * imports
- readxl * imports
- shiny * imports
- shinydashboard >= 0.7.2 imports
- shinydisconnect >= 0.1.0 imports
- shinyjs >= 2.1.0 imports
- stats * imports
- stringr * imports
- testthat * imports
- tibble * imports
- tidyr * imports
- tidyverse * imports
- usethis * imports
- waiter * imports












