Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: adds causal to attention params #2408

Merged
merged 1 commit into from
Aug 13, 2024

Conversation

drbh
Copy link
Collaborator

@drbh drbh commented Aug 13, 2024

This PR adds causal=None to attention for flash attention v1.

This avoids throwing when the causal param is passed. and returns better errors in the case the flash attention v1 is used with window_size_left!=-1 or softcap not None

@Narsil Narsil merged commit 1cebccc into main Aug 13, 2024
11 checks passed
@Narsil Narsil deleted the add-causal-param-for-flash-v1-check branch August 13, 2024 14:19
yuanwu2017 pushed a commit to yuanwu2017/tgi-gaudi that referenced this pull request Sep 26, 2024
fix: adds causal to attention params to check when using flash attn v1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants