Recent Releases of dwarf-p-cloudsc

dwarf-p-cloudsc - v1.5.3

What's Changed

  • CLOUDSCFIELDSTATE_MOD: fix device memory leak for packed storage option by @awnawab in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/90
  • Single-precision support for CUDA variants by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/91
  • Single precision and architecture files for SYCL variants by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/92
  • Use single precision math functions for CUDA variants by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/95
  • loki_transform for CUF variants, bundle.yml fixup by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/99
  • Single-precision support for HIP variants by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/93
  • Add Lumi CCE17 (17.0.1 with rocm 6.0.3) toolchain and env file by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/97
  • Loki transformations from config file by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/94
  • Updated Field API variants of the clouds dwarf (CPU and GPU) by @wertysas in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/96
  • new variants OpenACC and OpenMP by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/88
  • OpenMP update module vars by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/101
  • omp k caching fix by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/102
  • New Field API Loki variant with state types in driver by @wertysas in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/100
  • Loki: Always inline stmt functions from C-headers by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/105
  • CUDA variant optimised beyond k-caching by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/104
  • HDF5 file access rights by @7schroet in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/108
  • Loki: C transpilation from pipeline in config file by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/106
  • Loki: Small fixes and clean-up; EC_PMON removal by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/111
  • CLOUDSC: Purge Loki-CLAW and GPU-CLAW variants from dwarf by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/110
  • Fix single-precision for manually implemented SCC-STACK OpenACC GPU variant by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/109
  • Add CLA to README by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/114

New Contributors

  • @wertysas made their first contribution in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/96
  • @7schroet made their first contribution in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/108

Full Changelog: https://github.com/ecmwf-ifs/dwarf-p-cloudsc/compare/v1.5.2...v1.5.3

- Fortran
Published by reuterbal 12 months ago

dwarf-p-cloudsc - v1.5.2

A bugfix release to point to the correct version tag for FIELD_API and fix Loki processing when using the OMNI frontend instead of the (default) FParser2.

What's Changed

  • Ensure yoecldp.F90 is parsed with FParser by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/82
  • Mark projects no longer as optional and fix field-api to v0.3.0 by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/83
  • Version 1.5.2 by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/84

Full Changelog: https://github.com/ecmwf-ifs/dwarf-p-cloudsc/compare/v1.5.1...v1.5.2

- Fortran
Published by reuterbal almost 2 years ago

dwarf-p-cloudsc - v1.5.1

This is a bugfix release that fixes a performance bug in the code of Loki-generated variants, restoring the expected throughput performance on GPUs.

What's Changed

  • Loki: enable imports and mark headers as ignored instead of blocked by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/80
  • v1.5.1 by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/81

Full Changelog: https://github.com/ecmwf-ifs/dwarf-p-cloudsc/compare/v1.5.0...v1.5.1

- Fortran
Published by reuterbal almost 2 years ago

dwarf-p-cloudsc - v1.5.0

What's New

  • A new variant based on Atlas, supporting FieldSet and MultiField storage backends (#60)
  • A new GPU-optimised OpenACC variant "k-caching" (#46)
  • HIP implementations for SCC, SCC-HOIST, SCC-K-CACHING (#59)
  • A new Python-interface variant with a Python driver that calls the Fortran code (#38)
  • A Python implementation generated via Loki (#51)
  • SYCL implementations for SCC, SCC-HOIST, SCC-K-CACHING (#64)
  • HDF5 input support for C-based variants
  • Update to the FIELDAPI variant to use the new open-source FIELDAPI library
  • Several updated or new architecture files

What's Changed

  • Remove obsolete architectures, add march=znver2 option to gnu by @piotrows in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/45
  • Cloudsc variant using SCC optimization with field_api (standalone repo) by @awnawab in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/25
  • Openacc k-caching variant by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/46
  • PyIface - Fortran-Python bridging with CMake compilation by @piotrows in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/38
  • Python: Adapt to latest GT4Py release by @stubbiali in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/50
  • Pure Python variant with test setup by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/51
  • Python-gt4py: Removing GT4Py-based implementation of CLOUDSC by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/52
  • Pool allocator via Loki by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/49
  • Enable gvmode and add nvidia/22.11 arch files by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/55
  • Switch to CTest for CI and add NVHPC 23.5 to CI testing by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/56
  • Architecture files for Leonardo by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/58
  • Bundle integration for Atlas by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/57
  • New Atlas-based variant using BlockStructured FunctionSpace by @sbrdar in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/54
  • CLOUDSC HIP (SCC, SCC-HOIST, SCC-K-CACHING) by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/59
  • Loki config update by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/61
  • HIP updates by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/63
  • SYCL-specific env and README entry by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/68
  • Introducing SYCL implementations/variants by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/64
  • Cuda update: metric col/s by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/67
  • Update to new open-source FIELD_API by @awnawab in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/70
  • use Atlas structures with an IFS-like variable batching by @sbrdar in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/60
  • Loki F2C using loki_transform instead of loki_transform_transpile by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/69
  • Emancipation from serialbox via HDF5 for C-style variants by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/62
  • Fix LUMI-G archs for HDF5 in HIP variants by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/72
  • HDF5 support for CUDA/HIP/SYCL variants by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/71
  • Adjust CUDA-specific loki config file to Loki v0.2.0 by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/75
  • Add semantic for reading thread count from $OMPNUMTHREADS by @antoine-morvan in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/73
  • Compatibility with Loki v0.2 by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/74
  • Update Github actions versions to remove node.js warnings by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/76
  • Update loki version to 0.2.0 and field_api to 0.3.0 by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/77
  • Version 1.5.0 by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/78

New Contributors

  • @awnawab made their first contribution in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/25
  • @sbrdar made their first contribution in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/54
  • @antoine-morvan made their first contribution in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/73

Full Changelog: https://github.com/ecmwf-ifs/dwarf-p-cloudsc/compare/v1.4.0...v1.5.0

- Fortran
Published by reuterbal almost 2 years ago

dwarf-p-cloudsc - Version 1.4.0

What's Changed

  • Remove dependence on the global modules from the cloudsc-fortran by @piotrows in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/35
  • Bug-fix for Loki-SCC/H array demotion by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/39
  • New Loki SCC-CUF variant by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/31
  • CUDA Fortran (CUF) 'k-caching' version (further optimized version) by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/33
  • Add col/s throughput metric in performance reports, add vector_length(NPROMA) to gang loops by @reuterbal in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/42
  • Initial FIELD API and gpu-scc-field variant by @mlange05 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/41
  • CLOUDSC SCC-CUDA-C (semi-automatic) implementations by @MichaelSt98 in https://github.com/ecmwf-ifs/dwarf-p-cloudsc/pull/40

Full Changelog: https://github.com/ecmwf-ifs/dwarf-p-cloudsc/compare/v1.3.0...v1.4.0

- Fortran
Published by reuterbal about 3 years ago

dwarf-p-cloudsc - Version 1.3.0

This release includes a significant number of changes and new developments:

  • An OpenMP-offload variant (dwarf-cloudsc-gpu-omp-scc-hoist), derived from the OpenACC SCC-hoist variant, developed by L. Lucido (Atos).
  • A new CUDA Fortran implementation (dwarf-cloudsc-gpu-scc-cuf), implementing the SCC loop layout. This requires the --with-cuda flag to build.
  • A new Python implementation, based on GT4Py, capable of generating CPU and GPU code (using cupy and, optionally, DaCe)
  • Deprecation of the dwarf-cloudsc-gpu-claw variant, which no longer works correctly on recent NVIDIA software stacks. Building this variant requires adding an explicit --with-claw flag to the build command.
  • With Loki publicly available, the references to the Loki repository have been updated and testing of the source-to-source translation variants is now possible.
  • A JUBE benchmark configuration has been added to ease testing across various platforms.
  • Clean-up of the CMake scripts
  • New arch files for:
    • HPC2020, the ECMWF Atos system in Bologna
    • LUMI
    • MeluXina
    • Additional Isambard partitions

- Fortran
Published by reuterbal about 3 years ago

dwarf-p-cloudsc - Version 1.2.0

- Fortran
Published by reuterbal about 4 years ago