[BUG] finetune w/o lora, inference get error: Negative code found: {codes} #370

sairin1202 · 2024-07-10T12:55:24Z

不使用lora训练的ckpt，load后出现Negative code found: {codes}，生成codes出现0，但是训练loss正常

如果使用lora训练后merge的模型，一切正常

yy524 · 2024-07-11T12:15:17Z

我也遇到同样问题，请问您找到原因了吗？

sairin1202 · 2024-07-14T02:40:42Z

还没有。。

Jielin-Qiu · 2024-07-29T22:04:17Z

Same issue here. Pretrain from scratch or finetune (not lora) on new dataset --> Negative code found: {codes}

So far:

official weight --> ok
official weight + lora --> ok
our pretrain weight --> error
our pretrain weight + lora --error
official weight + finetune on our data (not lora) --> error

Some examples:

Pretrain from scratch - Step 50k: Audio
Finetune (not lora) - Step 10k: Audio

I am able to detect that there are indeed negative values:
Negative values found: tensor([-1, -1, -1, -1, -1, -1, -1, -1], device='cuda:0', dtype=torch.int32)

Not sure if it is an error in the training pipeline, guessing -1 shouldn't be decoded as output?

liulangdeyeshou · 2024-08-03T08:34:38Z

我前几天遇到了和你一样的问题，后来我改了config.json中的n_local_heads后就好了。我猜测是因为我改了网络结构后，部分参数对不上，导致了推理结果错误。我是在使用lora去重新finetune我训练的pretrain权重的时候发现的这个问题。你可以去尝试check下你的config.json里的参数。

github-actions · 2024-09-16T00:22:40Z

This issue is stale because it has been open for 30 days with no activity.

sairin1202 added the bug Something isn't working label Jul 10, 2024

github-actions bot added the stale label Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] finetune w/o lora, inference get error: Negative code found: {codes} #370

[BUG] finetune w/o lora, inference get error: Negative code found: {codes} #370

sairin1202 commented Jul 10, 2024

yy524 commented Jul 11, 2024

sairin1202 commented Jul 14, 2024

Jielin-Qiu commented Jul 29, 2024

liulangdeyeshou commented Aug 3, 2024

github-actions bot commented Sep 16, 2024

[BUG] finetune w/o lora, inference get error: Negative code found: {codes} #370

[BUG] finetune w/o lora, inference get error: Negative code found: {codes} #370

Comments

sairin1202 commented Jul 10, 2024

yy524 commented Jul 11, 2024

sairin1202 commented Jul 14, 2024

Jielin-Qiu commented Jul 29, 2024

liulangdeyeshou commented Aug 3, 2024

github-actions bot commented Sep 16, 2024