support effective tokens calculation on sft/dpo #6078

wtmlon · 2024-11-19T09:47:59Z

What does this PR do?

support efficient tokens calculation on sft and dpo.

hiyouga

Thanks for adding this feature, please see the above comments

src/llamafactory/train/dpo/workflow.py

hiyouga

LGTM

support efficient tokens calculation on sft/dpo

b9f0028

wtmlon changed the title ~~support efficient tokens calculation on sft/dpo~~ support effective tokens calculation on sft/dpo Nov 19, 2024

wtmlon added 2 commits November 19, 2024 19:10

update

ef6e145

update

f566ecc

hiyouga requested changes Nov 19, 2024

View reviewed changes

src/llamafactory/train/dpo/workflow.py Outdated Show resolved Hide resolved

src/llamafactory/train/dpo/workflow.py Show resolved Hide resolved

code refactor

40627c6

hiyouga approved these changes Nov 20, 2024

View reviewed changes

wtmlon temporarily deployed to tests November 20, 2024 04:11 — with GitHub Actions Inactive

hiyouga merged commit bd639a1 into hiyouga:main Nov 20, 2024
12 checks passed

hiyouga added the solved This problem has been already solved label Nov 20, 2024