Lower embedding bag forward only #6951

bhavya01 · 2024-04-22T18:57:42Z

No description provided.

wonjoolee95

LGTM!

ysiraichi · 2024-05-07T18:46:19Z

torch_xla/csrc/aten_xla_type.cpp

+std::tuple<at::Tensor, at::Tensor, at::Tensor, at::Tensor>
+XLANativeFunctions::_embedding_bag_forward_only(
+    const at::Tensor& weight, const at::Tensor& indices,
+    const at::Tensor& offsets, bool scale_grad_by_freq, int64_t mode,
+    bool sparse, const c10::optional<at::Tensor>& per_sample_weights,
+    bool include_last_offset, int64_t padding_idx) {
+  TORCH_LAZY_FN_COUNTER_TIMED_TRACING("xla::");
+  if (mode == 1 || scale_grad_by_freq || sparse || padding_idx != -1) {
+    return at::native::call_fallback_fn<
+        &xla_cpu_fallback,
+        ATEN_OP(_embedding_bag_forward_only)>::call(weight, indices, offsets,
+                                                    scale_grad_by_freq, mode,
+                                                    sparse, per_sample_weights,
+                                                    include_last_offset,
+                                                    padding_idx);
+  }


@bhavya01 Is there a reason only _embedding_bag_forward_only (instead of also lowering _embedding_bag)? What about the fallback condition? Is there a specific reason we are not lowering that, too?

If I recall correctly, the code for the other rest of _embedding_bag was overly complicated so we measured it as out of scope for this PR. @bhavya01, please correct me if I'm wrong.

That's right! We still need to lower _embedding_bag.

Hey @ysiraichi, would lowering _embedding_bag entirely be something that you need?

Well, it would be nice having that. I won't be working on it right now, since I have things with higher priority to be done. Anyway, I have this draft branch that can help us lowering + maintaining composite operations. It does mainly 2 things:

Allow us to write lowerings for composite operations in Python

Check for implemented decompositions (PyTorch + PyTorch/XLA) whenever we hit the fallback function. If it finds one, use that to decompose the operation into possibly already decomposed operations

At the moment, PyTorch decompositions are only used in dynamo

Allow us to use PyTorch decompositions on non-dynamo experiments

Allow us to use PyTorch/XLA specific decompositions on both dynamo and non-dynamo experiments

I won't be working on this PR for a while, so if anyone wants to take over it, I don't mind. If this PR gets merged, we would probably have an easier time lowering operations.

lower embedding bag forward only

d82a2cc

bhavya01 requested a review from wonjoolee95 April 22, 2024 18:57

bhavya01 closed this Apr 22, 2024

bhavya01 reopened this Apr 22, 2024

lint fix

4568ae2

wonjoolee95 approved these changes Apr 22, 2024

View reviewed changes

bhavya01 merged commit 46919a4 into master Apr 22, 2024
21 checks passed

bhavya01 deleted the embeddingbag branch April 22, 2024 22:28

ysiraichi reviewed May 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lower embedding bag forward only #6951

Lower embedding bag forward only #6951

bhavya01 commented Apr 22, 2024

wonjoolee95 left a comment

ysiraichi May 7, 2024

wonjoolee95 May 7, 2024

bhavya01 May 7, 2024

wonjoolee95 May 7, 2024

ysiraichi May 9, 2024

Lower embedding bag forward only #6951

Lower embedding bag forward only #6951

Conversation

bhavya01 commented Apr 22, 2024

wonjoolee95 left a comment

Choose a reason for hiding this comment

ysiraichi May 7, 2024

Choose a reason for hiding this comment

wonjoolee95 May 7, 2024

Choose a reason for hiding this comment

bhavya01 May 7, 2024

Choose a reason for hiding this comment

wonjoolee95 May 7, 2024

Choose a reason for hiding this comment

ysiraichi May 9, 2024

Choose a reason for hiding this comment