[Vulkan] `ti.sync()` changes program behavior on Vulkan #3791

AmesingFlank · 2021-12-14T00:40:21Z

Example of when ti.sync() changes program behavior on Vulkan: #3790. In this example, the program only behaves correctly when the ti.sync() is added.

On vulkan, we aggressively batch consecutive kernel launches into a single command buffer. We probably need to add a buffer_barrier or something between consecutive compute dispatches. I will investigate this.

The text was updated successfully, but these errors were encountered:

bobcao3 · 2021-12-14T01:14:31Z

We do have barriers between them.

taichi/taichi/backends/vulkan/runtime.cpp

Line 336 in 9f0d9f5

cmdlist->memory_barrier();

AmesingFlank · 2021-12-14T01:15:08Z

Yeah, I saw it. So why is this happening :((

AmesingFlank · 2021-12-14T01:16:27Z

To repro:

import taichi as ti

ti.init(ti.vulkan)

def parallel_sort(x):
    N = x.shape[0]

    @ti.kernel
    def sort_stage(x:ti.template(), N:int, p:int, k:int, invocations:int):
        for inv in range(invocations):
            j = k%p + inv * 2 * k
            for i in range(0, min(k,N-j-k)):
                a = i+j
                b = i+j+k
                if int(a / (p*2)) == int(b / (p*2)):
                    val_a = x[a]
                    val_b = x[b]
                    if val_a > val_b:
                        x[a] = val_b
                        x[b] = val_a

    p = 1
    while p < N:
        k = p
        while k >= 1:
            invocations = int((N-k-k%p) / (2*k)) + 1
            sort_stage(x,N,p,k, invocations)
            # ti.sync()
            k = int(k/2)
        p = int(p * 2)
  
def test_sort():
    def test_sort_for_dtype(dtype,N):
        x = ti.field(dtype, N)

        @ti.kernel
        def fill():
            for i in x:
                x[i] = ti.random()*N
        
        fill()
        parallel_sort(x)

        x_host = x.to_numpy()
        
        for i in range(N-1):
            assert x_host[i] <= x_host[i+1]

    test_sort_for_dtype(ti.i32,1)
    test_sort_for_dtype(ti.i32,256)
    test_sort_for_dtype(ti.i32,100001)
    test_sort_for_dtype(ti.f32,1)
    test_sort_for_dtype(ti.f32,256)
    test_sort_for_dtype(ti.f32,100001)

test_sort()

bobcao3 · 2021-12-14T01:19:49Z

What if we change

taichi/taichi/backends/vulkan/vulkan_device.cpp

Line 808 in 9f0d9f5

/*srcStageMask=*/

Change it to src_stage_mask=BOTTOM_OF_PIPE and dst_stage_mask=TOP_OF_PIPE

AmesingFlank · 2021-12-14T01:24:49Z

Doesn't seem to fix it :<

AmesingFlank added the potential bug Something that looks like a bug but not yet confirmed label Dec 14, 2021

AmesingFlank self-assigned this Dec 14, 2021

AmesingFlank mentioned this issue Dec 14, 2021

[lang] Add parallel sort utility #3790

Merged

bobcao3 mentioned this issue Dec 16, 2021

[vulkan] Fix command serial ordering & uses less queue submits #3818

Merged

whorfin mentioned this issue Dec 19, 2021

Nested Operations on "large" fields fails on Vulkan and OpenGL, works on Cuda #3544

Open

strongoier closed this as completed in #3818 Dec 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Vulkan] `ti.sync()` changes program behavior on Vulkan #3791

[Vulkan] `ti.sync()` changes program behavior on Vulkan #3791

AmesingFlank commented Dec 14, 2021 •

edited

Loading

bobcao3 commented Dec 14, 2021

AmesingFlank commented Dec 14, 2021

AmesingFlank commented Dec 14, 2021 •

edited

Loading

bobcao3 commented Dec 14, 2021

AmesingFlank commented Dec 14, 2021

[Vulkan] ti.sync() changes program behavior on Vulkan #3791

[Vulkan] ti.sync() changes program behavior on Vulkan #3791

Comments

AmesingFlank commented Dec 14, 2021 • edited Loading

bobcao3 commented Dec 14, 2021

AmesingFlank commented Dec 14, 2021

AmesingFlank commented Dec 14, 2021 • edited Loading

bobcao3 commented Dec 14, 2021

AmesingFlank commented Dec 14, 2021

[Vulkan] `ti.sync()` changes program behavior on Vulkan #3791

[Vulkan] `ti.sync()` changes program behavior on Vulkan #3791

AmesingFlank commented Dec 14, 2021 •

edited

Loading

AmesingFlank commented Dec 14, 2021 •

edited

Loading