Recent Releases of eigencuda

eigencuda - Version 0.4.0

Offload the Eigen3 matrix-matrix multiplication to an Nvidia GPU using CUBLAS.

Changed

Split the memory management (CudaMatrix) from the CUBLAS invocation (CudaPipeline)
Moved all the allocation to the smart pointers inside CudaMatrix
- Removed unused headers

- C++
Published by felipeZ over 6 years ago

eigencuda - Introduced smart pointer

[0.3.0] 26/09/2019

Added

Smart pointers to handle cuda resources
New CudaMatrix class
Check available memory in the GPU before computing

Removed

Template class, implementation only available for double
Triple tensor product
Shapes struct

- C++
Published by felipeZ over 6 years ago

eigencuda - Tensor matrix multiplication

[0.2.0] 27/08/2019

Added

Tensor matrix multiplacation using gemmbatched
Async calls to memory copies.
Properly free memory after the tensor operation is done.

- C++
Published by felipeZ almost 7 years ago