Adding Native Support of SYCL for Intel GPUs #4749

airMeng · 2024-01-03T09:24:14Z

Feature Description

Hi the community, following the discussion #3965, we plan to contribute native SYCL backend to llama.cpp.

Motivation

Intel Arc series GPU provides accountable VRAM size and bandwidth, which the current OpenCL backend can't fully utilize especially on LLM. We expect a significant performance improvement with native SYCL backend.

References:

SYCL
DPC++

Possible Implementation

Native Kernels

We will implement the key operators of GGML in SYCL similar to the approach of supporting Metal and Vulkan. Basically, the steps are described as below:

new backend; h2d & d2h
oneMKL-dpcpp based FP32 & FP16 GEMM
native SYCL kernels for de-quantization
native SYCL kernels for other operators

Note:

Since llama.cpp has been evolving rapidly and new features will probably be supported through CUDA first, we plan to enable SYCLomatic to help migrate the code from CUDA to SYCL.

We plan to further introduce the template-based library e.g., XeTLA as mentioned in #3965 as the next stage, while we will be focusing on native SYCL support in this proposal.

Summary

We started working on native SYCL kernels and enabling SYCL backend in llama.cpp for Intel GPUs. Please feel free to drop a note. Thanks.

Jacoby1218 · 2024-01-05T18:19:10Z

There's already work ongoing for this in #2690

airMeng · 2024-01-22T08:23:16Z

hi please check f42df4d

airMeng added the enhancement New feature or request label Jan 3, 2024

This was referenced Jan 4, 2024

Intel Arc thread oobabooga/text-generation-webui#3761

Closed

Arc Installation - OSError: [WinError 126] The specified module could not be found. Error loading "backend_with_compiler.dll" or one of its dependencies. oobabooga/text-generation-webui#5123

Closed

airMeng closed this as completed Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Native Support of SYCL for Intel GPUs #4749

Adding Native Support of SYCL for Intel GPUs #4749

airMeng commented Jan 3, 2024 •

edited

Loading

Jacoby1218 commented Jan 5, 2024

airMeng commented Jan 22, 2024

Adding Native Support of SYCL for Intel GPUs #4749

Adding Native Support of SYCL for Intel GPUs #4749

Comments

airMeng commented Jan 3, 2024 • edited Loading

Feature Description

Motivation

Possible Implementation

Native Kernels

Summary

Jacoby1218 commented Jan 5, 2024

airMeng commented Jan 22, 2024

airMeng commented Jan 3, 2024 •

edited

Loading