[Bugfix] Clean up some cruft in mamba.py #9343

tlrmchlsmth · 2024-10-14T14:19:45Z

Delete some unnecessary/useless code I noticed while reviewing #7478

github-actions · 2024-10-14T14:19:57Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

vllm/model_executor/models/mamba.py

tlrmchlsmth · 2024-10-14T18:55:09Z

Ok, this was extremely boneheaded -- just removed an entire feed forwards section that I had copy-pasted from Jamba, that isn't present in the Mamba model.

mgoin

Now that is clean 😎

mgoin · 2024-10-14T18:57:11Z

vllm/model_executor/models/mamba.py

@@ -332,40 +282,20 @@ def forward(
            current_ssm_state = ssm_state[i]
            current_conv_state = conv_state[i]

-            hidden_states, residual = layer(
+            hidden_states = layer(
                positions=positions,
                hidden_states=hidden_states,
                attn_metadata=attn_metadata,
                residual=residual,


Residual is still passed in here. I personally think it is fine to keep the previous structure of passing residual into rmsnorm, but up to you

I saw modeling_mamba.py do it this way and like that it's super clean, but you're right -- going to revert to the previous way so that we fuse. Sidenote: this is the kind of optimization we should be relying on torch.compile to do for us IMO

[Bugfix] Clean up some cruft in mamba.py

bc08785

Another one

104281c

DarkLight1337 approved these changes Oct 14, 2024

View reviewed changes

DarkLight1337 enabled auto-merge (squash) October 14, 2024 14:59

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 14, 2024

mgoin reviewed Oct 14, 2024

View reviewed changes

vllm/model_executor/models/mamba.py Outdated Show resolved Hide resolved

vllm/model_executor/models/mamba.py Outdated Show resolved Hide resolved

🚪

e7a5bb7

mgoin approved these changes Oct 14, 2024

View reviewed changes

tlrmchlsmth disabled auto-merge October 14, 2024 19:01

Revert to previous residual style

5f22037

tlrmchlsmth enabled auto-merge (squash) October 14, 2024 19:10

tlrmchlsmth merged commit 169b530 into vllm-project:main Oct 15, 2024
56 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Clean up some cruft in mamba.py #9343

[Bugfix] Clean up some cruft in mamba.py #9343

tlrmchlsmth commented Oct 14, 2024

github-actions bot commented Oct 14, 2024

tlrmchlsmth commented Oct 14, 2024

mgoin left a comment

mgoin Oct 14, 2024

tlrmchlsmth Oct 14, 2024

[Bugfix] Clean up some cruft in mamba.py #9343

[Bugfix] Clean up some cruft in mamba.py #9343

Conversation

tlrmchlsmth commented Oct 14, 2024

github-actions bot commented Oct 14, 2024

tlrmchlsmth commented Oct 14, 2024

mgoin left a comment

Choose a reason for hiding this comment

mgoin Oct 14, 2024

Choose a reason for hiding this comment

tlrmchlsmth Oct 14, 2024

Choose a reason for hiding this comment