castgen

castgen: R package with methods for estimating the number of individuals to sample from a population in order to capture a predefined percentage of diversity

https://github.com/alex-sandercock/castgen

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.6%) to scientific vocabulary

Keywords

breeding conservation cran-r genomics
Last synced: 6 months ago · JSON representation

Repository

castgen: R package with methods for estimating the number of individuals to sample from a population in order to capture a predefined percentage of diversity

Basic Info
  • Host: GitHub
  • Owner: alex-sandercock
  • License: apache-2.0
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 140 KB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Topics
breeding conservation cran-r genomics
Created over 1 year ago · Last pushed 11 months ago
Metadata Files
Readme License

README.md

R-CMD-check CRAN Status Badge GitHub License

castgen

This package and included functions were developed for the American chestnut landscape genomics manuscript. The aim was to develop a method for estimating the number of individuals to sample from a population in order to capture a predefined percentage of diversity within a breeding population.

Getting Started

A VCF file with GT information or a genotype matrix can be used as input

Installation

```{R}

castgen is available from CRAN and can be installed using the following command:

install.packages("castgen")

```

Output data

The output will provide the following information:

```

Number of individuals to sample = 22.0

95% Confidence Intervals = (16.378127142052662, 27.621872857947338)

Iterations performed = 5 ```

Citation

If you use this package, please cite:

A.M. Sandercock, J.W. Westbrook, Q. Zhang, & J.A. Holliday, A genome-guided strategy for climate resilience in American chestnut restoration populations, Proc. Natl. Acad. Sci. U.S.A. 121 (30) e2403505121, https://doi.org/10.1073/pnas.2403505121 (2024).

Notes

The capturediversity functions were converted from a Python function <https://github.com/alex-sandercock/Capturegenomic_diversity>. These R versions of the function are much (much) faster to run and have the option to run in parallel depending on cores available.

image

Owner

  • Name: Alexander Sandercock
  • Login: alex-sandercock
  • Kind: user

PhD Student in Genetics, Bioinformatics, and Computational Biology @virginia_tech | ICTAS Doctoral Scholar

GitHub Events

Total
  • Release event: 1
  • Push event: 16
  • Pull request event: 3
  • Create event: 2
Last Year
  • Release event: 1
  • Push event: 16
  • Pull request event: 3
  • Create event: 2

Packages

  • Total packages: 1
  • Total downloads:
    • cran 195 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 1
  • Total maintainers: 1
cran.r-project.org: castgen

Estimate Sample Size for Population Genomic Studies

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 195 Last month
Rankings
Dependent packages count: 26.9%
Dependent repos count: 33.1%
Average: 48.9%
Downloads: 86.9%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • Rdpack >= 0.7 imports
  • doParallel >= 1.0.17 imports
  • dplyr >= 1.1.2 imports
  • foreach >= 1.5.2 imports
  • vcfR >= 1.15.0 imports