Recent Releases of simulateqcd
simulateqcd - v1.2.0
This release primarily includes improvements for AMD GPUs
Changelog: - Communication streams have been moved to CommunicationBase - Added launch bounds for HIP kernel - Improved RHMC for AMD GPUs - Manual loop unrolling in Dslash for AMD GPUs - Updated profiling applications - added more profiling applications for development: - axpy - triad - 7linkprof - cgprofiling
- C++
Published by lukas-mazur about 2 years ago
simulateqcd - v1.1.0
Changelog:
- Simplified Haloloop
- Added P2P support on AMD GPUs
- Added Marker API for Profiling for NVIDIA and AMD GPUs
- blocksize fix in runfunctor
- CMakeList.txt update
- Multi-RHS dslash improvements
- Fixed clang warnings
- Various bug fixes
- C++
Published by lukas-mazur over 2 years ago
simulateqcd - v1.0.1
Improved performance for AMD GPUs.
- C++
Published by lukas-mazur over 2 years ago
simulateqcd - v1.0.0
This is SIMULATeQCD version 1.0.0. The first release version.
- C++
Published by lukas-mazur almost 3 years ago