[Pallas] Introduce make_kernel_from_pallas #6713

alanwaketan · 2024-03-11T19:04:21Z

Summary:
This pull request introduces make_kernel_from_pallas API which is the top level API to interact with the Pallas integration. It takes a pallas_call wrapper and than make it a custom pytorch op.

Test Plan:
python test/test_pallas.py

JackCaoG · 2024-03-11T19:08:33Z

torch_xla/experimental/custom_kernel.py

@@ -56,3 +62,50 @@ def _extract_backend_config(
      if op.name == "stablehlo.custom_call":
        return op.backend_config.value
  return None
+
+
+def convert_torch_dtype_to_jax(dtype: torch.dtype) -> jnp.dtype:


@qihqi do we already have such converstation somewhere in torchxla2?

It's generated by copilot, lol

JackCaoG · 2024-03-11T19:10:25Z

test/test_pallas.py

+                                        (x.shape, x.dtype))
+
+    dtypes = [torch.float32, torch.float
+             ]  # TODO: torch.float64, torch.bfloat16, torch.float16 don't work.


why bf16 won't work?

Mosaic complaints. Need to dig more into it.

JackCaoG · 2024-03-11T19:11:58Z

test/test_pallas.py

+    import jax
+    import jax.numpy as jnp
+    import jax._src.pallas.mosaic.pallas_call_registration


seems like this is repeated on multiple tests, maybe just move to the top?

There is a compatibility issue where jax will try to lock tpu devices if we import them before any pt/xla computations... I will need to resolve that...

JackCaoG

Do you need this pr in 2.3?

alanwaketan · 2024-03-11T19:16:05Z

Do you need this pr in 2.3?

Yea, will also need a couple for the TODOs.

miladm · 2024-03-12T17:50:09Z

test/test_pallas.py

+  @unittest.skipIf(xr.device_type() != 'TPU', "This test only works on TPU.")
+  # TODO: This test cannot be ran individually, let's fix it.
+  def test_tpu_custom_call_pallas_wrap_add_payload(self):
+    import jax


I am concerned JAX-based tests cause failures due to libtpu version inconsistencies, and in turn CI hiccups. How do we resolve this concern?

That's resolved in the last PR: #6696

miladm · 2024-03-12T17:51:32Z

torch_xla/experimental/custom_kernel.py

+
+
+def make_kernel_from_pallas(kernel: Callable, output_shape_dtype_fn: Callable):
+  # TODO: Maybe we can cache the payload for the same input.


The payload may change if the input is dynamic. We need to confirm this with pallas folks.

Right, the cache itself should deal with the dynamism.

alanwaketan · 2024-03-12T23:39:47Z

Can I get any reviews?

JackCaoG

I still think we should refactor convert_torch_dtype_to_jax and invesgate bf16(which I assume most people will use), approve to unblock.

alanwaketan · 2024-03-13T00:11:43Z

I still think we should refactor convert_torch_dtype_to_jax and invesgate bf16(which I assume most people will use), approve to unblock.

Yea, for sure. Let me follow up with that.

Summary: This pull request introduces make_kernel_from_pallas API which is the top level API to interact with the Pallas integration. It takes a pallas_call wrapper and than make it a custom pytorch op. Test Plan: python test/test_pallas.py

Co-authored-by: Jiewen Tan <jwtan@google.com>

alanwaketan requested review from JackCaoG and qihqi March 11, 2024 19:04

JackCaoG reviewed Mar 11, 2024

View reviewed changes

miladm reviewed Mar 12, 2024

View reviewed changes

miladm self-requested a review March 12, 2024 17:52

miladm assigned alanwaketan Mar 12, 2024

miladm added the backport_2.3 label Mar 12, 2024

alanwaketan added 5 commits March 12, 2024 23:38

tmp

d19aa79

initial commit

cc4572d

Remove some jax code

f43ca7f

Improve the tests

e71e559

fix linters

ae6b62b

alanwaketan force-pushed the alanwaketan/pallas_api branch from 8b8be2e to ae6b62b Compare March 12, 2024 23:38

JackCaoG approved these changes Mar 12, 2024

View reviewed changes

alanwaketan merged commit 1bbe333 into master Mar 13, 2024
19 checks passed

alanwaketan deleted the alanwaketan/pallas_api branch March 13, 2024 18:39

lsy323 added a commit that referenced this pull request Mar 13, 2024

[Pallas] Introduce make_kernel_from_pallas (#6713) (#6742)

e556ad8

Co-authored-by: Jiewen Tan <jwtan@google.com>

alanwaketan mentioned this pull request Mar 13, 2024

2.3 backport PR request list #6676

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pallas] Introduce make_kernel_from_pallas #6713

[Pallas] Introduce make_kernel_from_pallas #6713

alanwaketan commented Mar 11, 2024

JackCaoG Mar 11, 2024

alanwaketan Mar 11, 2024

JackCaoG Mar 11, 2024

alanwaketan Mar 11, 2024

JackCaoG Mar 11, 2024

alanwaketan Mar 11, 2024

JackCaoG left a comment

alanwaketan commented Mar 11, 2024

miladm Mar 12, 2024 •

edited

Loading

alanwaketan Mar 12, 2024

miladm Mar 12, 2024

alanwaketan Mar 12, 2024

alanwaketan commented Mar 12, 2024

JackCaoG left a comment

alanwaketan commented Mar 13, 2024



		def make_kernel_from_pallas(kernel: Callable, output_shape_dtype_fn: Callable):
		# TODO: Maybe we can cache the payload for the same input.

[Pallas] Introduce make_kernel_from_pallas #6713

[Pallas] Introduce make_kernel_from_pallas #6713

Conversation

alanwaketan commented Mar 11, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JackCaoG left a comment

Choose a reason for hiding this comment

alanwaketan commented Mar 11, 2024

miladm Mar 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alanwaketan commented Mar 12, 2024

JackCaoG left a comment

Choose a reason for hiding this comment

alanwaketan commented Mar 13, 2024

miladm Mar 12, 2024 •

edited

Loading