-
Notifications
You must be signed in to change notification settings - Fork 658
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ROCm][Codegen] llama 8b fp8 with attention segfault #19921
Comments
Can you share the full stacktrace? I only see Edit: ah, it's in the first gist link. Thanks. |
This is the exact https://gist.github.com/pashu123/0b261b96af91e893e055c662d9e8079b dispatch, which is failing, I'm looking into it. |
|
Possibly unrelated to the crash, but there shouldn't be any collapse/expand ops in the dispatch like that. |
Oh! I just ran the compile command and dumped the dispatches. Could you take a look at that part? The above dispatch is not running because of |
I just opened a PR and it clears the tensor reshape ops, but still hitting the same error with the attention dispatch |
|
After applying llvm/llvm-project#126102 this dispatch is still failing with
@krzysz00 Do you have any suggestions for this? |
I had a chat with @krzysz00 offline; we need to create another pass similar to https://github.com/iree-org/iree/blob/main/compiler/src/iree/compiler/Codegen/Common/ConvertBf16ArithToF32.cpp for operations like arith.negf with fp8 types. |
New codegen issue llama_f8_attn_bug_log_0213.txt after I rebase iree to
Should we create a new issue id or continue this issue? @pashu123 |
Please create a new issue! The error is from the vector distribute pipeline. |
File a new issue for the vector distribute pipeline #19991 |
What happened?
iree compile bug llama_8b_fp8_atten_iree-comiple_gdbbug.txt
gdb shows the bug here so I guess its a codegen roc issue
Steps to reproduce your issue
run the following cmd:
What component(s) does this issue relate to?
Compiler
Version information
commit 3f713f5 (HEAD -> main, upstream/main)
Author: Jakub Kuderski jakub@nod-labs.com
Date: Wed Jan 29 12:49:42 2025 -0500
Additional context
No response
The text was updated successfully, but these errors were encountered: