-
Notifications
You must be signed in to change notification settings - Fork 28.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
run_summarization_no_trainer #18189
Comments
Did you run |
accelerate env Copy-and-paste the text below in your GitHub issue
accelerate test Running: accelerate-launch --config_file=None /home/arij/anaconda3/envs/sum/lib/python3.9/site-packages/accelerate/test_utils/test_script.py The combined stderr from workers follows: |
That was a typo, sorry. You need to run |
one of the questions is Do you want to use DeepSpeed? [yes/NO]: |
could you please send any link that helps how to figure the questionaire using deepspeed? |
Any way these are my steps
Still something wrong) |
I think there should be full instructions on how to use accelerate , it is not clear. Thanks for your reply |
Interesting that I was facing the exact same issue right now. The fix for me was to pass the local config I created.
|
@soumyasanyal could you please tell the steps I am absolutely new) or post your config |
Sure! I just followed the steps in this link. The steps I followed are:
My config file is as follows (but it can change as per your requirements. I just wanted to run a job on 8 GPUs in a single node, without DeepSpeed or mixed precision):
I was previously running Also FYI, note that the doc says that integration of accelerate with DeepSpeed is experimental. |
@sgugger sorry for reopenning the issue while using this script using T5 over cnn-dialy dataset
I got the error
For this error replacing gather_for_metrics with just with this configuration
I get this error
|
@Arij-Aladel in this case you should reduce your batch size most likely, but I'll be running it myself in just a moment |
I did already still problem of not finding gather_for_metric attribute |
You can simply run the example as is |
Thanks @Arij-Aladel, I think I have found the fix. Can you try running the following training script on your end to verify? (I have wget to make your life easy): (Also as mentioned in the other post please make sure you have a pypi version of accelerate >= 0.12.0 to run the scripts, a PR was just merged yesterday to make them a requirement for all these scripts) wget https://raw.githubusercontent.com/huggingface/transformers/muellerzr-fix-no-trainer/examples/pytorch/summarization/run_summarization_no_trainer.py |
@muellerzr thanks for your response! As I understand your fix is just deleting this line
my life with wget was not easier)))
|
@Arij-Aladel yes the fix got merged yesterday, you can find it here: https://github.com/huggingface/transformers/blob/main/examples/pytorch/summarization/run_summarization_no_trainer.py I would highly recommend doing |
Ok seems it was package installation issue after your fix, I have uninstalled all packages then reinstall packages according to requirements file. It works now thanks @muellerzr |
Great! Can this be closed now @Arij-Aladel? :) |
Yes , thanks . I am closing it. |
@sgugger Hello! I just tried to run the code to explore this example https://github.com/huggingface/transformers/blob/main/examples/pytorch/summarization/run_summarization_no_trainer.py
this is my yml file to build the env
then pip install rouge-score
after that simply I ran thhe command
accelerate launch run_summarization_no_trainer.py --model_name_or_path t5-small --dataset_name cnn_dailymail --dataset_config '3.0.0' --source_prefix 'summarize: ' --output_dir output/tst-summarization
and got the error
How to fix it?
The text was updated successfully, but these errors were encountered: