seqArchR
seqArchR: Identifying (promoter) sequence architectures de novo using NMF
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.7%) to scientific vocabulary
Keywords
Repository
seqArchR: Identifying (promoter) sequence architectures de novo using NMF
Basic Info
- Host: GitHub
- Owner: snikumbh
- License: gpl-3.0
- Language: R
- Default Branch: main
- Homepage: https://snikumbh.github.io/seqArchR
- Size: 11.7 MB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 2
- Releases: 0
Topics
Metadata Files
README.md
seqArchR
<!-- bioc badges: start -->
<!-- bioc badges: end -->
seqArchR is an unsupervised, non-negative matrix factorization (NMF)-based algorithm for discovery of sequence architectures de novo. Below is a schematic of seqArchR's algorithm.

Installation
Python scikit-learn dependency
This package requires the Python module scikit-learn. Please see installation instructions here.
To install this package, use
```r
if (!requireNamespace("remotes", quietly = TRUE)) {
install.packages("remotes")
}
remotes::installgithub("snikumbh/seqArchR", buildvignettes = FALSE) ```
Usage
```r
load package
library(seqArchR) library(Biostrings)
Creation of one-hot encoded data matrix from FASTA file
You can use your own FASTA file instead
inputFastaFilename <- system.file("extdata", "example_data.fa", package = "seqArchR", mustWork = TRUE)
Specifying dinuc generates dinucleotide features
inputSeqsMat <- seqArchR::preparedatafromFASTA(inputFastaFilename, sinucor_dinuc = "dinuc")
inputSeqsRaw <- seqArchR::preparedatafromFASTA(inputFastaFilename, rawseq = TRUE)
nSeqs <- length(inputSeqsRaw) positions <- seq(1, Biostrings::width(inputSeqsRaw[1]))
Set seqArchR configuration
Most arguments have default values
seqArchRconfig <- seqArchR::setconfig( parallelize = TRUE, ncores = 2, nruns = 100, kmin = 1, kmax = 20, modseltype = "stability", bound = 10^-6, chunksize = 100, resultaggl = "ward.D", resultdist = "euclid", flags = list(debug = FALSE, time = TRUE, verbose = TRUE, plot = FALSE) )
Call/Run seqArchR
seqArchRresult <- seqArchR::seqArchR(config = seqArchRconfig, seqsohemat = inputSeqsMat, seqsraw = inputSeqsRaw, seqspos = positions, totalitr = 2, setocollation = c(TRUE, FALSE))
```
Contact
Comments, suggestions, enquiries/requests are welcome! Feel free to email sarvesh.nikumbh@gmail.com or create an new issue
Owner
- Name: Sarvesh Nikumbh
- Login: snikumbh
- Kind: user
- Location: London
- Website: https://snikumbh.github.io
- Twitter: sarveshnikumbh
- Repositories: 20
- Profile: https://github.com/snikumbh
Post-Doc@MRC LMS; MPI-INF PhD. Other profile at MPI-GitHub: https://github.molgen.mpg.de/snikumbh
GitHub Events
Total
Last Year
Committers
Last synced: about 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| snikumbh | s****h@g****m | 125 |
| Nitesh Turaga | n****a@g****m | 2 |
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 3
- Total pull requests: 0
- Average time to close issues: 9 days
- Average time to close pull requests: N/A
- Total issue authors: 3
- Total pull request authors: 0
- Average comments per issue: 1.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- johannesnicolaus (1)
- snikumbh (1)
- parsboy66 (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- bioconductor 6,615 total
- Total dependent packages: 1
- Total dependent repositories: 0
- Total versions: 5
- Total maintainers: 1
bioconductor.org: seqArchR
Identify Different Architectures of Sequence Elements
- Homepage: https://snikumbh.github.io/seqArchR/ https://github.com/snikumbh/seqArchR
- Documentation: https://bioconductor.org/packages/release/bioc/vignettes/seqArchR/inst/doc/seqArchR.pdf
- License: GPL-3 | file LICENSE
-
Latest release: 1.12.0
published 10 months ago
Rankings
Maintainers (1)
Dependencies
- actions/cache v2 composite
- actions/checkout v2 composite
- actions/upload-artifact main composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v2 composite
- actions/cache v2 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v1 composite
- actions/cache v2 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v1 composite
- r-lib/actions/setup-r v1 composite
- R >= 4.2.0 depends
- BiocParallel * imports
- Biostrings * imports
- MASS * imports
- Matrix * imports
- cli * imports
- cluster * imports
- cvTools >= 0.3.2 imports
- fpc * imports
- ggplot2 >= 3.1.1 imports
- ggseqlogo >= 0.1 imports
- grDevices * imports
- graphics * imports
- matrixStats * imports
- methods * imports
- prettyunits * imports
- reshape2 >= 1.4.3 imports
- reticulate >= 1.22 imports
- stats * imports
- utils * imports
- BiocStyle * suggests
- covr * suggests
- cowplot * suggests
- hopach >= 2.42.0 suggests
- knitr >= 1.22 suggests
- rmarkdown >= 1.12 suggests
- testthat >= 3.0.2 suggests
- vdiffr >= 0.3.0 suggests