[SYCL][UR] Make v2 L0 the default (L0) adapter for BMG and newer #19333

igchor · 2025-07-07T21:18:42Z

The loader will iterate over all adapters and call urAdapterGet() on them. The adapters will return numAdapters, which will always be 1 for CUDA, HIP, OpenCL and NativeCPU and either 1 or 0 for L0. If we are on BMG or newer, the v2 adapter will return 1, otherwise, the legacy will return 1.

llvm-lit still respects sycl_devices: level_zero:gpu means legacy adapter and level_zero_v2:gpu means v2 adapter.

This can be changed in later PRs.

pbalcer

lgtm in general. But I think this patch should also consolidate building v1/v2 under the same CMake flag. When this is merged, both .so files will be required for the runtime to function properly on all systems.

This is needed so that we can enable V2 adapter by default on certain platforms: #19333 The reason is that we need to load both adapters (legacy and v2) to check the device version. However, loading v2 adapter causes L0 loader to emit logs for all API calls (if ZE_DEBUG=1 is set). Since the legacy adapter used different logic for printing API calls, this would result in printing the same logs twice. This patch fixes that.

Whenever we detect ANY device newer than BMG on the platform we will use V2, otherwise we will use V1. The default behavior can still be overwritten by setting SYCL_UR_USE_LEVEL_ZERO_V2=1 to use V2 adapter or SYCL_UR_USE_LEVEL_ZERO_V2=0 to use V1 adapter.

igchor · 2025-07-10T16:24:43Z

lgtm in general. But I think this patch should also consolidate building v1/v2 under the same CMake flag. When this is merged, both .so files will be required for the runtime to function properly on all systems.

That's a good point, but I we'll need to modify the logic for UR conformance tests as they rely on the cmake var to decide which adapter to use. Ideally, we could specify that during runtime or just have the tests be run for all adapters (both legacy and v2). I think I would prefer to do that in a separate PR.

igchor · 2025-07-10T17:39:11Z

lgtm in general. But I think this patch should also consolidate building v1/v2 under the same CMake flag. When this is merged, both .so files will be required for the runtime to function properly on all systems.

That's a good point, but I we'll need to modify the logic for UR conformance tests as they rely on the cmake var to decide which adapter to use. Ideally, we could specify that during runtime or just have the tests be run for all adapters (both legacy and v2). I think I would prefer to do that in a separate PR.

Unless we just remove the option to select the adapter for testing entirely and just always use the default one for a given adapter. One problem with this though is that we still expect to run v2 on older hardware if there are multiple devices on the platform.

This is needed so that we can enable V2 adapter by default on certain platforms: intel/llvm#19333 The reason is that we need to load both adapters (legacy and v2) to check the device version. However, loading v2 adapter causes L0 loader to emit logs for all API calls (if ZE_DEBUG=1 is set). Since the legacy adapter used different logic for printing API calls, this would result in printing the same logs twice. This patch fixes that.

igchor · 2025-07-11T23:44:03Z

I decided to do the adapter filtering on urAdapterGet. The logic now looks like this:

The loader will iterate over all adapters and call urAdapterGet() on them. The adapters will return numAdapters, which will always be 1 for CUDA, HIP, OpenCL and NativeCPU and either 1 or 0 for L0. If we are on BMG or newer, the v2 adapter will return 1, otherwise, the legacy will return 1.

@nrspruit could you take a look at changes in level_zero/adapter.cpp? I removed the PlatformsCache and now, I initialize all platforms during adapter init (in urAdapterGet). This is needed so that we can check the device id and decide whether we should use V1 or V2.

igchor · 2025-07-11T23:44:49Z

Regression/reduction_resource_leak_dw.cpp started passing after my changes. I removed the XFAIL. As far as I can tell this is due to the adapter being initialized earlier, so the construction/destruction order changed and now the context is destroyed before the leak checker is triggered.

Unexpectedly Passed Tests (1):
  SYCL :: Regression/reduction_resource_leak_dw.cpp

This is needed so that we can enable V2 adapter by default on certain platforms: intel/llvm#19333 The reason is that we need to load both adapters (legacy and v2) to check the device version. However, loading v2 adapter causes L0 loader to emit logs for all API calls (if ZE_DEBUG=1 is set). Since the legacy adapter used different logic for printing API calls, this would result in printing the same logs twice. This patch fixes that.

nrspruit

This looks good to me, the split makes sense and the shared init looks clean, thanks for the changes!

igchor · 2025-07-14T20:38:22Z

@intel/sycl-graphs-reviewers could you please take a look? There are no graph-sepcific changes in this PR, but I do modify the logic for e2e tests and adapter loading logic.

againull

LGTM, I assume this means that v2 functionality is on par with v1 on BMG or newer. Also, probably it makes sense to do some testing downstream to see if there are any issues, if it hasn't been done yet.

igchor · 2025-07-14T20:46:37Z

LGTM, I assume this means that v2 functionality is on par with v1 on BMG or newer. Also, probably it makes sense to do some testing downstream to see if there are any issues, if it hasn't been done yet.

@againull Yes, we've already done testing on BMG and addressed most of the issues. We do have functional parity.
@intel/llvm-reviewers-runtime could you please review/approve?

igchor temporarily deployed to WindowsCILock July 7, 2025 21:18 — with GitHub Actions Inactive

igchor had a problem deploying to WindowsCILock July 7, 2025 21:46 — with GitHub Actions Failure

pbalcer reviewed Jul 8, 2025

View reviewed changes

igchor mentioned this pull request Jul 9, 2025

[SYCL][UR] Unify logging and leak checking for L0 v1 and v2 #19328

Merged

igchor force-pushed the v2_by_default branch from b1877ae to 7170ae0 Compare July 10, 2025 15:24

igchor temporarily deployed to WindowsCILock July 10, 2025 15:24 — with GitHub Actions Inactive

igchor changed the title ~~V2 by default~~ [SYCL][UR] Make v2 L0 the default (L0) adapter for BMG and newer Jul 10, 2025

fix warning

abb0daf

igchor had a problem deploying to WindowsCILock July 10, 2025 16:13 — with GitHub Actions Error

igchor had a problem deploying to WindowsCILock July 10, 2025 16:14 — with GitHub Actions Error

igchor marked this pull request as ready for review July 10, 2025 16:24

igchor requested review from a team as code owners July 10, 2025 16:24

igchor requested review from a team as code owners July 10, 2025 17:30

igchor requested review from reble and againull July 10, 2025 17:30

igchor temporarily deployed to WindowsCILock July 10, 2025 17:30 — with GitHub Actions Inactive

igchor temporarily deployed to WindowsCILock July 10, 2025 18:04 — with GitHub Actions Inactive

igchor had a problem deploying to WindowsCILock July 10, 2025 18:04 — with GitHub Actions Failure

igchor had a problem deploying to WindowsCILock July 10, 2025 20:48 — with GitHub Actions Error

igchor had a problem deploying to WindowsCILock July 10, 2025 20:57 — with GitHub Actions Failure

igchor temporarily deployed to WindowsCILock July 10, 2025 21:23 — with GitHub Actions Inactive

Use single var to enable both adapters

30fa2d8

igchor force-pushed the v2_by_default branch from 8adbfa5 to ba42634 Compare July 11, 2025 22:36

igchor had a problem deploying to WindowsCILock July 11, 2025 22:36 — with GitHub Actions Error

Skip adapters with no platforms in urAdapterGet

081b512

igchor force-pushed the v2_by_default branch from ba42634 to 081b512 Compare July 11, 2025 22:50

igchor temporarily deployed to WindowsCILock July 11, 2025 22:51 — with GitHub Actions Inactive

igchor had a problem deploying to WindowsCILock July 11, 2025 23:14 — with GitHub Actions Failure

igchor temporarily deployed to WindowsCILock July 14, 2025 15:19 — with GitHub Actions Inactive

igchor temporarily deployed to WindowsCILock July 14, 2025 15:55 — with GitHub Actions Inactive

Remove XFAIL

0466f0d

igchor mentioned this pull request Jul 14, 2025

reduction_resource_leak_dw.cpp failing on windows #16418

Closed

nrspruit approved these changes Jul 14, 2025

View reviewed changes

againull reviewed Jul 14, 2025

View reviewed changes

PatKamin approved these changes Jul 15, 2025

View reviewed changes

reble approved these changes Jul 15, 2025

View reviewed changes

againull self-requested a review July 15, 2025 15:47

againull approved these changes Jul 15, 2025

View reviewed changes

againull merged commit 1833f44 into intel:sycl Jul 15, 2025
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][UR] Make v2 L0 the default (L0) adapter for BMG and newer #19333

[SYCL][UR] Make v2 L0 the default (L0) adapter for BMG and newer #19333

Uh oh!

igchor commented Jul 7, 2025 •

edited

Loading

Uh oh!

pbalcer left a comment

Uh oh!

igchor commented Jul 10, 2025

Uh oh!

igchor commented Jul 10, 2025 •

edited

Loading

Uh oh!

igchor commented Jul 11, 2025

Uh oh!

igchor commented Jul 11, 2025 •

edited

Loading

Uh oh!

nrspruit left a comment

Uh oh!

igchor commented Jul 14, 2025

Uh oh!

againull left a comment

Uh oh!

igchor commented Jul 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[SYCL][UR] Make v2 L0 the default (L0) adapter for BMG and newer #19333

[SYCL][UR] Make v2 L0 the default (L0) adapter for BMG and newer #19333

Uh oh!

Conversation

igchor commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pbalcer left a comment

Choose a reason for hiding this comment

Uh oh!

igchor commented Jul 10, 2025

Uh oh!

igchor commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

igchor commented Jul 11, 2025

Uh oh!

igchor commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nrspruit left a comment

Choose a reason for hiding this comment

Uh oh!

igchor commented Jul 14, 2025

Uh oh!

againull left a comment

Choose a reason for hiding this comment

Uh oh!

igchor commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

igchor commented Jul 7, 2025 •

edited

Loading

igchor commented Jul 10, 2025 •

edited

Loading

igchor commented Jul 11, 2025 •

edited

Loading

igchor commented Jul 14, 2025 •

edited

Loading