Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nf4_detach missing args error when calling fully_shard on latest nightly #380

Closed
ebsmothers opened this issue Jun 15, 2024 · 4 comments
Closed
Assignees
Labels
bug Something isn't working

Comments

@ebsmothers
Copy link
Contributor

Hi, I am seeing the error TypeError: nf4_detach() missing 1 required positional argument: 'args' when running torchtune's QLoRA FSDP2 recipe on the latest ao nightlies. Full stack trace

Repro:

tune download meta-llama/Llama-2-7b-hf --output-dir /tmp/Llama-2-7b-hf --hf-token <HF_TOKEN>
tune run --nnodes 1 --nproc_per_node 4 lora_finetune_fsdp2 --config llama2/7B_qlora enable_activation_checkpointing=False

Looks like it's happening when we try to call fully_shard on a TransformerDecoderLayer. When I downgrade to e.g. the 6/8 nightly things seem to work.

@msaroufim msaroufim added the bug Something isn't working label Jun 15, 2024
@msaroufim
Copy link
Member

msaroufim commented Jun 15, 2024

This might have been caused by #360 - cc @jerryzh168
@weifengpy did we ever end up merging some sanity tests in ao to detect these kinds of breakages cause relying purely on tune for testing seems less than ideal?

@weifengpy
Copy link
Contributor

good catch! @ebsmothers

@msaroufim you are right. I will also check FSDP e2e in TorchAO just like how we do it in TorchTune

This might have been caused by #360 - cc @jerryzh168 @weifengpy did we ever end up merging some sanity tests in ao to detect these kinds of breakages cause relying purely on tune for testing seems less than ideal?

@supriyar
Copy link
Contributor

@weifengpy can we close this issue now?

@weifengpy
Copy link
Contributor

@weifengpy can we close this issue now?

Yes, we can close it as it's landed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants