Recent Releases of eigencuda
eigencuda - Version 0.4.0
Offload the Eigen3 matrix-matrix multiplication to an Nvidia GPU using CUBLAS.
Changed
- Split the memory management (
CudaMatrix) from the CUBLAS invocation (CudaPipeline) - Moved all the allocation to the smart pointers inside
CudaMatrix- Removed unused headers
- C++
Published by felipeZ over 6 years ago
eigencuda - Introduced smart pointer
[0.3.0] 26/09/2019
Added
- Smart pointers to handle cuda resources
- New CudaMatrix class
- Check available memory in the GPU before computing
Removed
- Template class, implementation only available for double
- Triple tensor product
- Shapes struct
- C++
Published by felipeZ over 6 years ago
eigencuda - Tensor matrix multiplication
[0.2.0] 27/08/2019
Added
- Tensor matrix multiplacation using gemmbatched
- Async calls to memory copies.
- Properly free memory after the tensor operation is done.
- C++
Published by felipeZ almost 7 years ago