[shardformer] write an shardformer example with bert finetuning #4111

flybird11111 · 2023-06-28T09:02:22Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

#4110

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

write an shardformer example with bert finetuning

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

FrankLeeeee · 2023-06-30T01:44:46Z

examples/language/bert/test_ci.sh

@@ -6,3 +6,5 @@ pip install -r requirements.txt
 for plugin in "torch_ddp" "torch_ddp_fp16" "gemini" "low_level_zero"; do
   torchrun --standalone --nproc_per_node 4  finetune.py --target_f1 0.86 --plugin $plugin --model_type "bert"
 done
+
+torchrun --standalone --nproc_per_node=1 shardformer_benchmark.py


In order to test the sharding, please use more than 1 GPU and set a small number of iteration so that it can finish running within 1-2 minutes.

flybird11111 added the example example-related issuer or pull request label Jun 28, 2023

FrankLeeeee linked an issue Jun 28, 2023 that may be closed by this pull request

[shardformer] write an shardformer example with bert finetuning #4110

Closed

flybird11111 requested review from FrankLeeeee and ver217 June 29, 2023 09:35

FrankLeeeee reviewed Jun 30, 2023

View reviewed changes

flybird11111 closed this Jun 30, 2023

flybird11111 force-pushed the feature/shardformer branch from adf145f to ad604f7 Compare June 30, 2023 08:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[shardformer] write an shardformer example with bert finetuning #4111

[shardformer] write an shardformer example with bert finetuning #4111

flybird11111 commented Jun 28, 2023

FrankLeeeee Jun 30, 2023 •

edited

Loading

[shardformer] write an shardformer example with bert finetuning #4111

[shardformer] write an shardformer example with bert finetuning #4111

Conversation

flybird11111 commented Jun 28, 2023

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

FrankLeeeee Jun 30, 2023 • edited Loading

Choose a reason for hiding this comment

FrankLeeeee Jun 30, 2023 •

edited

Loading