AOMP Release 0.6-3
Pre-release
Pre-release
gregrodgers
released this
28 May 18:20
·
1017 commits
to master
since this release
Like 0.6-2, this release is based off the stable version of clang/llvm 8.0.
These changes are from 0.6-2.
- New support for synchronous services called hostcall.
- The source to support hostcall can be found in a new repository called aomp-extras in the hostcall directory
- There are minor changes to atmi to support hostcall. These are in branch atmi-0.5-063.
- Removed printf end-of-kernel service and added to hostcall. printf is now much more reliable from the gpu.
- Enhancements to toolchain to support static device libraries
- fix to correctly pickup math functions from libm-.bc . Previously it was seeing math functions as builtins.
- Suppress calls to __kmpc_push_target_count for host code, resolves undefined reference.
- Allow -frtti flag to be honored if user requests it on command line.
- Add AOMP/include path before /usr/local/include to pick up correct header for omp.h.
- Generate Metadata for both SPMD and Generic offload targets.
- Honor OMP_TEAM_LIMIT for work groups, just like OMP_NUM_TEAMS.
- Added *_wg_size symbol to reflect compile time known thread limit for a kernel.
- Added support to openmp runtimes to support 1024 threads per team/work group.
- Reenabled SILoadStoreOptimizer pass after pulling upstream fix for scalar carry corruption.
- Fixed amdgcn noinline and alwaysinline incompatibility issue for the Parallel Data Sharing Wrapper