Science Score: 36.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 9 DOI reference(s) in README -
○Academic publication links
-
✓Committers with academic emails
5 of 14 committers (35.7%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.9%) to scientific vocabulary
Keywords
Repository
Tools to work with variant call format files
Statistics
- Stars: 261
- Watchers: 16
- Forks: 53
- Open Issues: 41
- Releases: 15
Topics
Metadata Files
README.md
VcfR: a package to manipulate and visualize VCF data in R

VcfR is an R package intended to allow easy manipulation and visualization of variant call format (VCF) data. Functions are provided to rapidly read from and write to VCF files. Once VCF data is read into R a parser function extracts matrices from the VCF data for use with typical R functions. This information can then be used for quality control or other purposes. Additional functions provide visualization of genomic data. Once processing is complete data may be written to a VCF file or converted into other popular R objects (e.g., genlight, DNAbin). VcfR provides a link between VCF data and the R environment connecting familiar software with genomic data.
VcfR is built upon two data structures.
vcfR - S4 class to contain data from a VCF file.
chromR - S4 class to contain variant information (VCF) and optional sequence (FASTA) and annotation (GFF) information.
Functions in vcfR provide the ability to subset VCF data as well as to extract and parse the data. For example, individual genotypes, sequence depths or genotype likelihoods (when provided in the VCF file) can easily be accessed. These tools are provided to aid researchers in rapidly surveying the quality and other characteristics of data provided as VCF data. With this information in hand, researchers should be able to determine criteria for hard filtering in order to attempt to maximize biological variation and minimize technical variation.
Documentation
Documentation for vcfR can now be found here: vcfR_documentation.
We also have Population genetics and genomics in R which is more general and provides examples of analyses.
If you think you've found a bug, please see reporting an issue.
Publication
vcfR articles
Knaus, Brian J., and Niklaus J. Grunwald. 2017. VCFR: a package to manipulate and visualize variant call format data in R. Molecular Ecology Resources 17(1):44-53. http://dx.doi.org/10.1111/1755-0998.12549.
Knaus, Brian J., and Niklaus J. Grunwald. 2016. VcfR: an R package to manipulate and visualize VCF format data. bioRxiv: 041277. http://dx.doi.org/10.1101/041277.
Copy number variation article
Knaus, Brian, and Niklaus J. Grünwald. 2018. Inferring variation in copy number using high throughput sequencing data in R. Frontiers in Genetics 9: 123. http://dx.doi.org/10.3389/fgene.2018.00123.
Download
vcfR is available at CRAN. To install use:
install.packages('vcfR')
The development version can be installed through github:
devtools::install_github(repo="knausb/vcfR")
library(vcfR)
If you would like the vignettes use:
devtools::install_github(repo="knausb/vcfR", build_vignettes=TRUE)
If you've built the vignettes, you can browse them with:
browseVignettes(package="vcfR")
If you've installed this package with devtools you will probably need to run:
devtools::install(build_vignettes = TRUE)
Devel branch
The devel branch (which may not be stable) can also be installed:
devtools::install_github(repo="knausb/vcfR@devel")
library(vcfR)
And to build the vignettes:
devtools::install_github(repo="knausb/vcfR@devel", build_vignettes=TRUE)
Software that produce VCF files
A fun part of this project has been learning about how people use vcfR. One facet of this is learning about the software that create VCF files. So I've decided to make a list of these software. If you know of a software that I have not included on this list, particularly if you can report that vcfR works with its files, feel free to let me know!
Genomic variant callers:
Restriction site associated DNA markers (e.g., RADseq, GBS):
Manipulation of VCF data:
- Beagle v4.1 Inputs VCF genotypes and outputs phased genotypes to VCF format
- pegas::read.vcf Population and Evolutionary Genetics Analysis System
- PyVCF
- SnpEff Genetic variant annotation and effect prediction toolbox
- Picard A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF
- VCF-kit VCF-kit is a command-line based collection of utilities for performing analysis on Variant Call Format (VCF) files.
- VCFtools General manipulation and analysis
- VariantAnnotation::readVcf Bioconductor package for annotating variants
R packages that read VCF data
Enjoy!
Owner
- Name: Brian Knaus
- Login: knausb
- Kind: user
- Location: Corvallis, OR, USA
- Company: Oregon State University
- Repositories: 67
- Profile: https://github.com/knausb
Evolutionary genomicist, computational biologist. Born and raised in Ann Arbor, MI. Currently living in Corvallis, OR.
GitHub Events
Total
- Issues event: 5
- Watch event: 13
- Issue comment event: 8
- Push event: 7
Last Year
- Issues event: 5
- Watch event: 13
- Issue comment event: 8
- Push event: 7
Committers
Last synced: over 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| Brian J. Knaus | b****s@g****m | 1,293 |
| Zhian Kamvar | k****z@s****u | 15 |
| Eric C. Anderson | e****n@n****v | 10 |
| Zhian Kamvar | z****r@g****m | 3 |
| shankarkshakya | k****a@y****m | 3 |
| Hadley Wickham | h****m@g****m | 2 |
| David Winter | d****r@g****m | 2 |
| Javier F. Tabima | c****1@g****m | 1 |
| Brian Knaus | k****b@1****u | 1 |
| Neato-Nick | n****n@g****m | 1 |
| Brian Knaus | k****b@a****u | 1 |
| Nick Kennedy | r@n****m | 1 |
| Nick Kennedy | n****y@e****k | 1 |
| Brian Knaus | k****b@B****l | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 99
- Total pull requests: 6
- Average time to close issues: 2 months
- Average time to close pull requests: 17 days
- Total issue authors: 86
- Total pull request authors: 6
- Average comments per issue: 3.84
- Average comments per pull request: 3.67
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 3
- Pull requests: 0
- Average time to close issues: 4 days
- Average time to close pull requests: N/A
- Issue authors: 3
- Pull request authors: 0
- Average comments per issue: 2.33
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- knausb (5)
- fdchevalier (4)
- mattbareno (2)
- gevro (2)
- Neato-Nick (2)
- tedtoal (2)
- gimeno99 (2)
- ghost (2)
- JimWhiting91 (2)
- katiesaund (1)
- redgcko7 (1)
- romanzenka (1)
- OJWatson (1)
- TomJamesW (1)
- audyavar (1)
Pull Request Authors
- gorgitko (1)
- hadley (1)
- jan-glx (1)
- RichardJActon (1)
- zkamvar (1)
- Neato-Nick (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 3
-
Total downloads:
- cran 5,803 last-month
- Total docker downloads: 1,203
-
Total dependent packages: 25
(may contain duplicates) -
Total dependent repositories: 55
(may contain duplicates) - Total versions: 46
- Total maintainers: 1
proxy.golang.org: github.com/knausb/vcfR
- Documentation: https://pkg.go.dev/github.com/knausb/vcfR#section-documentation
-
Latest release: v1.15.0
published about 2 years ago
Rankings
proxy.golang.org: github.com/knausb/vcfr
- Documentation: https://pkg.go.dev/github.com/knausb/vcfr#section-documentation
-
Latest release: v1.15.0
published about 2 years ago
Rankings
cran.r-project.org: vcfR
Manipulate and Visualize VCF Data
- Homepage: https://github.com/knausb/vcfR
- Documentation: http://cran.r-project.org/web/packages/vcfR/vcfR.pdf
- License: GPL-3
-
Latest release: 1.15.0
published about 2 years ago
Rankings
Maintainers (1)
Dependencies
- actions/checkout v2 composite
- r-lib/actions/check-r-package v1 composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- R >= 3.0.1 depends
- Rcpp * imports
- ape * imports
- dplyr * imports
- grDevices * imports
- graphics * imports
- magrittr * imports
- memuse * imports
- methods * imports
- pinfsc50 * imports
- stats * imports
- stringr * imports
- tibble * imports
- utils * imports
- vegan * imports
- viridisLite * imports
- adegenet * suggests
- ggplot2 * suggests
- knitr * suggests
- poppr * suggests
- reshape2 * suggests
- rmarkdown * suggests
- scales * suggests
- testthat * suggests
- tidyr * suggests