Recent Releases of simulateqcd

simulateqcd - v1.2.0

This release primarily includes improvements for AMD GPUs

Changelog: - Communication streams have been moved to CommunicationBase - Added launch bounds for HIP kernel - Improved RHMC for AMD GPUs - Manual loop unrolling in Dslash for AMD GPUs - Updated profiling applications - added more profiling applications for development: - axpy - triad - 7linkprof - cgprofiling

- C++
Published by lukas-mazur about 2 years ago

simulateqcd - v1.1.0

Changelog:

  • Simplified Haloloop
  • Added P2P support on AMD GPUs
  • Added Marker API for Profiling for NVIDIA and AMD GPUs
  • blocksize fix in runfunctor
  • CMakeList.txt update
  • Multi-RHS dslash improvements
  • Fixed clang warnings
  • Various bug fixes

- C++
Published by lukas-mazur over 2 years ago

simulateqcd - v1.0.1

Improved performance for AMD GPUs.

- C++
Published by lukas-mazur over 2 years ago

simulateqcd - v1.0.0

This is SIMULATeQCD version 1.0.0. The first release version.

- C++
Published by lukas-mazur almost 3 years ago