Skip to content

AOMP Release 0.6-4

Pre-release
Pre-release
Compare
Choose a tag to compare
@gregrodgers gregrodgers released this 17 Jun 13:31
· 976 commits to master since this release

Like 0.6-3, this release 0.6-4 of aomp is based off the stable version of clang/llvm 8.0.

These are the changes found in 0.6-4 compared to the previous 0.6-3 release.

  • support for building on SLES15 SP1
  • rpm package for SLES15 SP1
  • do not create a host thread for GPU hostcall services if no services are used by any kernel in the application. This fixes a performance regression we saw with openmpapps in 0.6-3 because none of those apps currently use printf on the device. This still needs more study.
  • Reorganized the github README and linked pages to make it less confusing and to ready support for more platforms.
  • removed hip wrapper scripts such as hipcc. Users must compile hip with clang++ as demonstrated in the examples to get openmp support with hip.
  • properly set amdgpu-flat-work-group-size for generic mode: add wave_size
  • add -lelf to link step of libomptarget.rtl.hsa.so
  • more gracefully exit when gpu arch of kernel does not match device arch
  • refine LIBPOMPTARGET_KERNEL_TRACE 1=>minimal, 2=>verbos'er