plinkqc

R package for quality control of plink genetic datasets

https://github.com/meyer-lab-cshl/plinkqc

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.9%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

R package for quality control of plink genetic datasets

Basic Info
  • Host: GitHub
  • Owner: meyer-lab-cshl
  • License: other
  • Language: R
  • Default Branch: master
  • Size: 21.5 MB
Statistics
  • Stars: 59
  • Watchers: 4
  • Forks: 27
  • Open Issues: 11
  • Releases: 9
Created over 7 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog License Code of conduct Codemeta

README.Rmd

---
output: github_document
---



```{r, echo = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "README-"
)
```

[![CRAN_Status_Badge](http://www.r-pkg.org/badges/version/plinkQC)](https://cran.r-project.org/package=plinkQC)
[![Build Status](https://travis-ci.org/meyer-lab-cshl/plinkQC.svg?branch=master)](https://travis-ci.org/meyer-lab-cshl/plinkQC)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![Downloads](http://cranlogs.r-pkg.org/badges/grand-total/plinkQC?color=blue)](https://cran.r-project.org/package=plinkQC)

##  plinkQC

**plinkQC** is a R/CRAN package for **genotype quality control** in genetic
association studies. It makes PLINK basic statistics (e.g.missing genotyping
rates per individual, allele frequencies per genetic marker) and relationship
functions easily accessible from within R and allows for automatic evaluation
of the results.


Full documentation is available at https://meyer-lab-cshl.github.io/plinkQC/.


**plinkQC** generates a per-individual and per-marker quality control report.
A step-by-step guide on how to run these analyses can be found
[here](https://meyer-lab-cshl.github.io/plinkQC/articles/plinkQC.html).

Individuals and markers that fail the quality control can subsequently be
removed with **plinkQC** to generate a new, clean dataset.

**plinkQC** facilitates an ancestry check for study individuals based on
comparison to reference datasets. The processing of the reference datasets is
documented in detail
[here](https://meyer-lab-cshl.github.io/plinkQC/articles/AncestryCheck.html).

Removal of individuals based on relationship status via **plinkQC** is optimised
to retain as many individuals as possible in the study.

##  Installation

The current github version of **plinkQC**  is: 0.3.4 and can be
installed via
```{r github install, eval=FALSE}
library(devtools)
install_github("meyer-lab-cshl/plinkQC")
```
The current CRAN version of **plinkQC**  is: 0.3.4 and can be
installed via
```{r  install, eval=FALSE}
install.packages("plinkQC")
```

A log of version changes can be found [here](https://meyer-lab-cshl.github.io/plinkQC/news/index.html).

##  Citation
Meyer HV (2020) plinkQC: Genotype quality control in genetic association
studies. [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.3934294.svg)](https://doi.org/10.5281/zenodo.3934294)

Owner

  • Name: Meyer Laboratory
  • Login: meyer-lab-cshl
  • Kind: organization
  • Email: hmeyer@cshl.edu

CodeMeta (codemeta.json)

{
  "@context": [
    "https://raw.githubusercontent.com/codemeta/codemeta/master/codemeta.jsonld",
    "http://schema.org"
  ],
  "@type": "Code",
  "author": [
    {
      "@type": "Person",
      "givenName": "Hannah",
      "familyName": "Meyer",
      "email": "hannah.v.meyer@gmail.com",
      "@id": "https://orcid.org/0000-0003-4564-0899"
    }
  ],
  "identifier": "plinkQC",
  "codeRepository": "https://meyer-lab-cshl.github.io/plinkQC",
  "dateModified": "2019-10-09",
  "dateCreated": "2019-10-09",
  "description": "Genotyping arrays enable the direct measurement of an individuals\n    genotype at thousands of markers. 'plinkQC' facilitates genotype quality\n    control for genetic association studies as described by Anderson and\n    colleagues (2010) <doi:10.1038/nprot.2010.116>. It makes 'PLINK' basic\n    statistics (e.g. missing genotyping rates per individual, allele frequencies\n    per genetic marker) and relationship functions accessible from 'R' and\n    generates a per-individual and per-marker quality control report.\n    Individuals and markers that fail the quality control can subsequently be\n    removed to generate a new, clean dataset. Removal of individuals based on\n    relationship status is optimised to retain as many individuals as possible\n    in the study.",
  "keywords": "genotype quality control, genetic association studies, plink",
  "license": "https://spdx.org/licenses/MIT",
  "title": "plinkQC",
  "version": "0.3.0",
  "name": "plinkQC: Genotype Quality Control with 'PLINK'",
  "issueTracker": "https://github.com/meyer-lab-cshl/plinkQC/issues",
  "programmingLanguage": {
    "@type": "ComputerLanguage",
    "name": "R",
    "version": "3.6.1",
    "url": "https://r-project.org"
  },
  "runtimePlatform": "R version 3.6.1 (2019-07-05)",
  "provider": {
    "@id": "https://cran.r-project.org",
    "@type": "Organization",
    "name": "Comprehensive R Archive Network (CRAN)",
    "url": "https://cran.r-project.org"
  },
  "contributor": {},
  "copyrightHolder": {},
  "funder": {},
  "maintainer": [
    {
      "@type": "Person",
      "givenName": "Hannah",
      "familyName": "Meyer",
      "email": "hannah.v.meyer@gmail.com",
      "@id": "https://orcid.org/0000-0003-4564-0899"
    }
  ],
  "softwareSuggestions": [
    {
      "@type": "SoftwareApplication",
      "identifier": "testthat",
      "name": "testthat",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=testthat"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "knitr",
      "name": "knitr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=knitr"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "rmarkdown",
      "name": "rmarkdown",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=rmarkdown"
    }
  ],
  "softwareRequirements": [
    {
      "@type": "SoftwareApplication",
      "identifier": "R",
      "name": "R",
      "version": ">= 3.6.0"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "methods",
      "name": "methods"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "optparse",
      "name": "optparse",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=optparse"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "data.table",
      "name": "data.table",
      "version": ">= 1.11.0",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=data.table"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "R.utils",
      "name": "R.utils",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=R.utils"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "ggplot2",
      "name": "ggplot2",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=ggplot2"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "ggforce",
      "name": "ggforce",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=ggforce"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "ggrepel",
      "name": "ggrepel",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=ggrepel"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "cowplot",
      "name": "cowplot",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=cowplot"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "UpSetR",
      "name": "UpSetR",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=UpSetR"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "dplyr",
      "name": "dplyr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=dplyr"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "igraph",
      "name": "igraph",
      "version": ">= 1.2.4",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=igraph"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "sys",
      "name": "sys",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=sys"
    }
  ],
  "releaseNotes": "https://github.com/meyer-lab-cshl/plinkQC/blob/master/NEWS.md",
  "readme": "https://github.com/meyer-lab-cshl/plinkQC/blob/master/README.md",
  "contIntegration": "https://travis-ci.org/meyer-lab-cshl/plinkQC",
  "relatedLink": "https://CRAN.R-project.org/package=plinkQC"
}

GitHub Events

Total
  • Issues event: 3
  • Watch event: 4
  • Issue comment event: 2
  • Push event: 8
Last Year
  • Issues event: 3
  • Watch event: 4
  • Issue comment event: 2
  • Push event: 8

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 369
  • Total Committers: 5
  • Avg Commits per committer: 73.8
  • Development Distribution Score (DDS): 0.187
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
HannahVMeyer h****r@g****m 300
HannahVMeyer H****r@g****m 30
Hannah h****r@g****m 19
HannahVMeyer H****r 19
Elika Garg e****r 1

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 45
  • Total pull requests: 12
  • Average time to close issues: about 2 months
  • Average time to close pull requests: about 2 hours
  • Total issue authors: 31
  • Total pull request authors: 2
  • Average comments per issue: 4.58
  • Average comments per pull request: 0.08
  • Merged pull requests: 12
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 2
  • Pull request authors: 0
  • Average comments per issue: 0.33
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • blueskypie (6)
  • rskl92 (3)
  • HannahVMeyer (3)
  • montenegrina (2)
  • fastforward35 (2)
  • evagradovich (2)
  • fransbudiman (2)
  • MichaelCummingsBCH (2)
  • carolinewalter5 (1)
  • umasaxena (1)
  • pwuUCL (1)
  • nickorr78 (1)
  • dietmarfdz (1)
  • pasiunaite (1)
  • GabrieleNocchi (1)
Pull Request Authors
  • HannahVMeyer (11)
  • eg-r (1)
Top Labels
Issue Labels
enhancement (3) help wanted (2) bug (2) question (1) invalid (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 436 last-month
  • Total docker downloads: 1,438
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 10
  • Total maintainers: 1
cran.r-project.org: plinkQC

Genotype Quality Control with 'PLINK'

  • Versions: 10
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 436 Last month
  • Docker Downloads: 1,438
Rankings
Forks count: 3.3%
Stargazers count: 8.0%
Average: 20.8%
Downloads: 27.6%
Dependent packages count: 29.8%
Dependent repos count: 35.5%
Maintainers (1)
Last synced: 7 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.6.0 depends
  • R.utils * imports
  • UpSetR * imports
  • cowplot * imports
  • data.table >= 1.11.0 imports
  • dplyr * imports
  • ggforce * imports
  • ggplot2 * imports
  • ggrepel * imports
  • igraph >= 1.2.4 imports
  • methods * imports
  • optparse * imports
  • sys * imports
  • formatR * suggests
  • knitr * suggests
  • mockery * suggests
  • rmarkdown * suggests
  • testthat * suggests