dcTensor

dcTensor: An R package for discrete matrix/tensor decomposition - Published in JOSS (2023)

https://github.com/rikenbit/dctensor

Science Score: 93.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: joss.theoj.org, zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Scientific Fields

Earth and Environmental Sciences Physical Sciences - 40% confidence
Last synced: 6 months ago · JSON representation

Repository

R package for Discrete Matrix/Tensor Decomposition

Basic Info
  • Host: GitHub
  • Owner: rikenbit
  • License: other
  • Language: R
  • Default Branch: main
  • Size: 975 KB
Statistics
  • Stars: 3
  • Watchers: 4
  • Forks: 0
  • Open Issues: 0
  • Releases: 8
Created about 3 years ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License Code of conduct

README.md

DOI CRAN_Status_Badge Downloads Total Downloads :name status badge :registry status badge :total status badge dcTensor status badge GitHub Actions status

dcTensor

dcTensor is an R package for Discrete Matrix/Tensor Decomposition. dcTensor provides the discretized version of matrix and tensor decomposition algorithms such as:

  • Discretized Non-negative Matrix Factorization Algorithms (dNMF)
  • Discretized Non-negative Matrix Tri-Factorization Algorithms (dNMTF)
  • Discretized Singular Value Decomposition (dSVD)
  • Discretized Simultaneous Non-negative Matrix Factorization Algorithms (dsiNMF)
  • Discretized Joint Non-negative Matrix Factorization Algorithms (djNMF)
  • Discretized Partial Least Squares (dPLS)
  • Discretized Non-negative CP Decomposition Algorithms (dNTF)
  • Discretized Non-negative Tucker Decomposition Algorithms (dNTD)

Here "discretized" means that the factor matrices extracted from the data are estimated with discretizing regularization, so that the values are binary (e.g., {0,1}) or ternary (e.g., {0,1,2}), as much as possible. Binary data analysis is recently featured in some data science domains such as market basket data, document-term data, Web click-stream data, DNA microarray expression profiles, or protein-protein complex interaction networks.

Installation (for users)

To install dcTensor from CRAN, type as follows:

~~~~ install.packages("dcTensor") ~~~~

Installation (for developers)

To install the latest dcTensor from GitHub, type as follows:

~~~~ git clone https://github.com/rikenbit/dcTensor/ R CMD INSTALL dcTensor ~~~~

or type the code below in the R console window

~~~~ library(devtools) devtools::install_github("rikenbit/dcTensor") ~~~~

How to perform dcTensor

For the details of dcTensor's functions, see the help page of each function as follows.

~~~~ library("dcTensor")

?toyModel ?dNMF ?dNMTF ?dSVD ?dsiNMF ?djNMF ?dPLS ?dNTF ?dNTD ~~~~

References

  • Binary Matrix Factorization (BMF)
    • Z. Zhang, T. Li, C. Ding and X. Zhang, "Binary Matrix Factorization with Applications," Seventh IEEE International Conference on Data Mining (ICDM 2007), Omaha, NE, USA, 2007, pp. 391-400, doi: 10.1109/ICDM.2007.99.
  • Non-negative Matrix Tri-Factorization (NMTF)
    • Copar, A. et al., Fast Optimization of Non-Negative Matrix Tri-Factorization: Supporting Information, PLOS ONE, 14(6), e0217994, 2019
    • Long, B. et al., Co-clustering by Block Value Decomposition, SIGKDD'05, 635–640, 2005
    • Ding, C. et al., Orthogonal Nonnegative Matrix Tri-Factorizations for Clustering, 12th ACM SIGKDD'06, 126–135, 2006
  • Singular Value Decomposition (SVD) based on Gradient Descent
    • Tsuyuzaki K, et al., Benchmarking principal component analysis for large-scale single-cell RNA-sequencing. BMC Genome Biology. 21(1), 9, 2020
  • Simultaneous Non-negative Matrix Factorization (siNMF)
    • Badea, L. Extracting Gene Expression Profiles Common to Colon and Pancreatic Adenocarcinoma using Simultaneous nonnegative matrix factorization, Pacific Symposium on Biocomputing, 279-290, 2008
    • Zhang, S. et al., Discovery of multi-dimensional modules by integrative analysis of cancer genomic data. Nucleic Acids Research, 40(19), 9379-9391, 2012
    • Yilmaz, Y. K. et al., Probabilistic Latent Tensor Factorization, IVA/ICA 2010, 346-353, 2010
  • Joint Non-negative Matrix Factorization (jNMF)
    • Zi, Yang, et al., A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data, Bioinformatics, 32(1), 1-8, 2016
  • Partial Least Squares (PLS) based on Gradient Descent
    • Arora, R. et al., Stochastic Optimization for PCA and PLS, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton), 861-868, 2012
  • Non-negative CP Decomposition (NTF)
    • α-Divergence (KL, Pearson, Hellinger, Neyman) / β-Divergence (KL, Frobenius, IS)
      • Cichocki, A. et al., Non-negative Tensor Factorization using Alpha and Beta Divergence, ICASSP '07, III-1393-III-1396, 2007
      • mathieubray/TensorKPD.R
    • Fast HALS
      • Phan, A. H. et al., Multi-way Nonnegative Tensor Factorization Using Fast Hierarchical Alternating Least Squares Algorithm (HALS), NOLTA 2008, 2008
    • α-HALS/β-HALS
      • Cichocki, A. et al., Fast Local Algorithms for Large Scale Nonnegative Matrix and Tensor Factorizations, IEICE Transactions, 92-A, 708-721, 2009
  • Non-negative Tucker Decomposition (NTD)
    • Frobenius/KL
      • Kim, Y.-D. et al., Nonnegative Tucker Decomposition, IEEE CVPR, 1-8, 2007
    • α-Divergence (KL, Pearson, Hellinger, Neyman) / β-Divergence (KL, Frobenius, IS)
      • Kim, Y.-D. et al., Nonneegative Tucker Decomposition with Alpha-Divergence, 2008
      • Phan, A. H. et al., Fast and efficient algorithms for nonnegative Tucker decomposition, ISNN 2008, 772-782, 2008
    • Fast HALS
      • Phan, A. H. et al., Extended HALS algorithm for nonnegative Tucker decomposition and its applications for multiway analysis and classification, Neurocomputing, 74(11), 1956-1969, 2011

Contributing

If you have suggestions for how dcTensor could be improved, or want to report a bug, open an issue! We'd love all and any contributions.

For more, check out the Contributing Guide.

Authors

  • Koki Tsuyuzaki

Owner

  • Name: RIKEN BiT
  • Login: rikenbit
  • Kind: organization
  • Location: Japan

Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research

JOSS Publication

dcTensor: An R package for discrete matrix/tensor decomposition
Published
August 25, 2023
Volume 8, Issue 88, Page 5664
Authors
Koki Tsuyuzaki ORCID
Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Japan, Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research, Japan
Editor
Patrick Diehl ORCID
Tags
discrete matrix factorization discrete tensor factorization dimension reduction

GitHub Events

Total
Last Year

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 21
  • Total Committers: 2
  • Avg Commits per committer: 10.5
  • Development Distribution Score (DDS): 0.143
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
koki k****r@h****p 18
露崎弘毅 k****i@d****p 3
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 8
  • Total pull requests: 0
  • Average time to close issues: 19 days
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 2.63
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • kokitsuyuzaki (7)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 315 last-month
  • Total docker downloads: 325
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 7
  • Total maintainers: 1
cran.r-project.org: dcTensor

Discrete Matrix/Tensor Decomposition

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 315 Last month
  • Docker Downloads: 325
Rankings
Forks count: 28.8%
Dependent packages count: 29.8%
Average: 32.6%
Downloads: 33.6%
Stargazers count: 35.2%
Dependent repos count: 35.5%
Last synced: 6 months ago

Dependencies

.github/workflows/build_test_push.yml actions
  • actions/checkout v2 composite
  • docker/build-push-action v1 composite
  • docker/login-action v1 composite
DESCRIPTION cran
  • R >= 3.4.0 depends
  • MASS * imports
  • fields * imports
  • methods * imports
  • nnTensor * imports
  • rTensor * imports
  • knitr * suggests
  • rmarkdown * suggests
  • testthat * suggests
Dockerfile docker
  • bioconductor/bioconductor_docker devel build