Recent Releases of rocprofiler-compute
rocprofiler-compute - rocprofiler-compute 3.1.1 for rocm-6.4.3
ROCm release v6.4.3
- Python
Published by rocm-ci 10 months ago
rocprofiler-compute - rocprofiler-compute 3.1.1 for rocm-6.4.2
ROCm release v6.4.2
- Python
Published by rocm-ci 11 months ago
rocprofiler-compute - rocprofiler-compute 3.1.0 for rocm-6.4.1
ROCm release v6.4.1
- Python
Published by rocm-ci about 1 year ago
rocprofiler-compute - rocprofiler-compute 3.1.0 for rocm-6.4.0
ROCm release v6.4.0
- Python
Published by rocm-ci about 1 year ago
rocprofiler-compute - https://github.com/ROCm/rocprofiler-compute/releases/tag/rocm-6.2.4
ROCm release v6.2.4
- Python
Published by rocm-ci over 1 year ago
rocprofiler-compute - Omniperf 2.1.0 for ROCm 6.2.2
Changes
- enable rocprofv1 for MI300 hardware (#391)
- refactoring and updating documemtation (#362, #394, #398, #414, #420)
- branch renaming and workflow updates (#389, #404, #409)
- bug fix for analysis output
- add dependency checks on application launch (#393)
- patch for profiling multi-process/multi-GPU applications (#376, #396)
- packaging updates (#386)
- rename CHANGES to CHANGELOG.md (#410)
- rollback Grafana version in Dockerfile for Angular plugin compatibility (#416)
- enable CI triggers for Azure CI (#426)
- add GPU model distinction for MI300 systems (#423)
- new MAINTAINERS.md guide for omniperf publishing procedures (#402)
Optimizations
- reduced running time of Omniperf when profiling (#384)
- console logging improvements
- Python
Published by github-actions[bot] over 1 year ago
rocprofiler-compute - https://github.com/ROCm/rocprofiler-compute/releases/tag/rocm-6.2.2
ROCm release v6.2.2
- Python
Published by rocm-ci over 1 year ago
rocprofiler-compute - https://github.com/ROCm/rocprofiler-compute/releases/tag/rocm-6.2.1
ROCm release v6.2.1
- Python
Published by rocm-ci over 1 year ago
rocprofiler-compute - https://github.com/ROCm/rocprofiler-compute/releases/tag/rocm-6.2.0
ROCm release v6.2.0
- Python
Published by rocm-ci almost 2 years ago
rocprofiler-compute - v2.0.1 (03 June 2024)
Updates
- new option to force hardware target via OMNIPERFARCHOVERRIDE global (#370)
- CI/CD support for MI300 hardware (#373)
- cmake build improvements (#374)
- support for MI308X hardware (#375)
Documentation available at https://rocm.github.io/omniperf/ Associated release tarball: omniperf-v2.0.1.tar.gz
- Python
Published by coleramos425 almost 2 years ago
rocprofiler-compute - v2.0.0 (17 May 2023)
Updates
- improved logging than spans all modes (#177) (#317) (#335) (#341)
- overhauled CI/CD that spans all modes (#179)
- extensible SoC classes to support easy addition of new hardware configs (#180)
- --kernel-verbose no longer overwrites kernel names (#193)
- general cleanup and improved organization of source code (#200) (#210)
- separate requirement files for docs and testing dependencies (#205) (#262) (#358)
- add support for MI300 hardware (#231)
- upgrade Grafana assets and build script to latest release (#235)
- update minimum ROCm and Python requirements (#277)
- sort rocprofiler input files prior to profiling (#304)
- new --quiet option will suppress verbose output and show a progress bar (#308)
- roofline support for Ubuntu 22.04 (#319)
Documentation available at https://rocm.github.io/omniperf/ Associated release tarball: omniperf-v2.0.0.tar.gz
- Python
Published by coleramos425 about 2 years ago
rocprofiler-compute - v2.0.0 RC #1 (15 March 2024)
This is the first release candidate for a forthcoming v2.0.0 release expected in March 2024. The 2.0 release provides a significant refactor of the underlying code base and introduces support for MI300.
Tech Preview documentation available at: https://amdresearch.github.io/omniperf/2.x Associated release tarball: omniperf-2.0.0-RC1.tar.gz
- Python
Published by koomie about 2 years ago
rocprofiler-compute - v2.0.0 Tech Preview #1 (03 March 2024)
This is a tech preview release for a forthcoming v2.0.0 release expected in March 2024. The 2.0 release provides a significant refactor of the underlying code base and introduces support for MI300.
Tech Preview documentation available at: https://amdresearch.github.io/omniperf/2.x Associated release tarball: omniperf-2.0.0-Tech-Preview1.tar.gz
- Python
Published by koomie about 2 years ago
rocprofiler-compute - v1.1.0-PR1 (13 October 2023)
Updates
- standardize headers to use 'avg' instead of 'mean'
- add color code thresholds to standalone gui to match grafana
- modify kernel name shortener to use cpp_filt (#168)
- enable stochastic kernel dispatch selection (#183)
- patch grafana plugin module to address a known issue in the latest version (#186)
- enhanced communication between analyze mode kernel flags (#187)
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.1.0-PR1.tar.gz
- Python
Published by coleramos425 over 2 years ago
rocprofiler-compute - v1.0.10 (22 August 2023)
Updates
- critical patch for detection of llvm in rocm installs on SLURM systems
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.10.tar.gz
- Python
Published by coleramos425 almost 3 years ago
rocprofiler-compute - v1.0.9 (17 August 2023)
Updates
- add units to L2 per-channel panel (#133)
- new quickstart guide for Grafana setup in docs (#135)
- more detail on kernel and dispatch filtering in docs (#136, #137)
- patch manual join utility for ROCm >5.2.x (#139)
- add % of peak values to low level speed-of-light panels (#140)
- patch critical bug in Grafana by removing a deprecated plugin (#141)
- enhancements to KernelName demangeler (#142)
- general metric updates and enhancements (#144, #155, #159)
- add min/max/avg breakdown to instruction mix panel (#154)
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.9.tar.gz
- Python
Published by coleramos425 almost 3 years ago
rocprofiler-compute - v1.0.8 (30 May 2023)
PR1 Updates
- add
--kernel-namesoption to toggle kernelName overlay in standalone roofline plot (#93) - remove unused python modules (#96)
- fix empirical roofline calculation for single dispatch workloads (#97)
- match color of arithmetic intensity points to corresponding bw lines
PR2 Updates
- ux improvements in standalone GUI (#101)
- enhanced readability for filtering dropdowns in standalone GUI (#102)
- new logfile to capture rocprofiler output (#106)
- roofline support for sles15 sp4 and future service packs (#109)
- adding dockerfiles for all supported Linux distros
- new examples for
--roof-onlyand--kerneloptions added to documentation
Additional Updates
- enable cli analysis in Windows (#110)
- optional random port number in standalone GUI (#111)
- limit length of visible kernelName in
--kernel-namesoption (#115) - adjust metric definitions (#117, #130)
- manually merge rocprof runs, overriding default rocprofiler implementation (#125)
- fixed compatibility issues with Python 3.11 (#131)
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.8.tar.gz
- Python
Published by coleramos425 about 3 years ago
rocprofiler-compute - v1.0.8-PR2 (17 April 2023)
Updates
- ux improvements in standalone GUI (#101)
- enhanced readability for filtering dropdowns in standalone GUI (#102)
- new logfile to capture rocprofiler output (#106)
- roofline support for sles15 sp4 and future service packs (#109)
- adding dockerfiles for all supported Linux distos
- new examples for
--roof-onlyand--kerneloptions added to documentation
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.8-PR2.tar.gz
- Python
Published by coleramos425 about 3 years ago
rocprofiler-compute - v1.0.8-PR1 (13 March 2023)
Updates
- add
--kernel-namesoption to toggle kernelName overlay in standalone roofline plot (#93) - remove unused python modules (#96)
- fix empirical roofline calculation for single dispatch workloads (#97)
- match color of arithmetic intensity points to corresponding bw lines
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.8-PR1.tar.gz
- Python
Published by coleramos425 about 3 years ago
rocprofiler-compute - v1.0.7 (22 February 2023)
Updates
- update documentation (#52, #64)
- improved detection of invalid command line arguments (#58, #76)
- enhancements to standalone roofline (#61)
- enable Omniperf on systems with X-server (#62)
- raise minimum version requirement for rocm (#64)
- enable baseline comparison in CLI analysis (#65)
- add multi-normalization to new metrics (#68, #81)
- support alternative profilers (#70)
- add MI100 configs to override rocprofiler's incomplete default (#75)
- improve error message when no GPU(s) detected (#85)
- separate CI tests by Linux distro and add status badges
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.7.tar.gz
- Python
Published by coleramos425 over 3 years ago
rocprofiler-compute - v1.0.6 (21 December 2022)
Updates
- CI update: documentation now published via github action (#22)
- better error detection for incomplete ROCm installs (#56)
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.6.tar.gz
- Python
Published by koomie over 3 years ago
rocprofiler-compute - v1.0.5 (13 December 2022)
Updates
- store application command-line parameters in profiling output (#27)
- enable additional normalizations in CLI mode (#30)
- add missing ubuntu 20.04 roofline binary to packaging (#34)
- update L1 bandwidth metric calculations (#36)
- add L1 <-> L2 bandwidth calculation (#37)
- documentation updates (#38, #41)
- enhanced subprocess logging to identify critical errors in rocprofiler (#50)
- maintain git sha in production installs from tarball (#53)
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.5.tar.gz
- Python
Published by koomie over 3 years ago
rocprofiler-compute - v1.0.4 (11 November 2022)
Updates
- update python requirements.txt with minimum versions for numpy and pandas
- addition of progress bar indicator in web-based GUI (#8)
- reduced default content for web-based GUI to reduce load times (#9)
- minor packaging and CI updates
- variety of documentation updates
- added an optional argument to vcopy.cpp workload example to specify device id
Documentation available at https://amdresearch.github.io/omniperf Associated release tarball: omniperf-v1.0.4.tar.gz
- Python
Published by koomie over 3 years ago
rocprofiler-compute - v1.0.3 (07 November 2022)
Initial release of the Omniperf performance profiling tool: * Documentation available at https://amdresearch.github.io/omniperf * Associated release tarball: omniperf-v1.0.3.tar.gz
- Python
Published by koomie over 3 years ago
rocprofiler-compute - 1.0.3 RC1
Release candidate #1 for v1.0.3 release of the Omniperf performance profiling tool: * Documentation available at https://amdresearch.github.io/omniperf * Associated release tarball: omniperf-v1.0.3-RC1.tar.gz
- Python
Published by koomie over 3 years ago