Skip to content

AOMP Release 0.6-3

Pre-release
Pre-release
Compare
Choose a tag to compare
@gregrodgers gregrodgers released this 28 May 18:20
· 1017 commits to master since this release

Like 0.6-2, this release is based off the stable version of clang/llvm 8.0.

These changes are from 0.6-2.

  • New support for synchronous services called hostcall.
  • The source to support hostcall can be found in a new repository called aomp-extras in the hostcall directory
  • There are minor changes to atmi to support hostcall. These are in branch atmi-0.5-063.
  • Removed printf end-of-kernel service and added to hostcall. printf is now much more reliable from the gpu.
  • Enhancements to toolchain to support static device libraries
  • fix to correctly pickup math functions from libm-.bc . Previously it was seeing math functions as builtins.
  • Suppress calls to __kmpc_push_target_count for host code, resolves undefined reference.
  • Allow -frtti flag to be honored if user requests it on command line.
  • Add AOMP/include path before /usr/local/include to pick up correct header for omp.h.
  • Generate Metadata for both SPMD and Generic offload targets.
  • Honor OMP_TEAM_LIMIT for work groups, just like OMP_NUM_TEAMS.
  • Added *_wg_size symbol to reflect compile time known thread limit for a kernel.
  • Added support to openmp runtimes to support 1024 threads per team/work group.
  • Reenabled SILoadStoreOptimizer pass after pulling upstream fix for scalar carry corruption.
  • Fixed amdgcn noinline and alwaysinline incompatibility issue for the Parallel Data Sharing Wrapper