Asking for questions about evaluation #6

mengmeng18 · 2023-09-26T01:45:46Z

Thanks for your great work! There is an issue during testing.
When using python main.py --function test --config configs/cub_stage2.yml --opt "{'test': {'load_token_path': 'ckpts/cub983/tokens/', 'load_unet_path': 'ckpts/cub983/unet/', 'save_log_path': 'ckpts/cub983/log.txt'}}" for evaluation, I found that self.step_store、self. attention_store and self.attention_maps are all empty. Would you please tell me where is wrong?
Looking forward to your reply!

callsys · 2023-09-26T02:44:37Z

The most likely reason is that the register_attention_control function at line 67 of attn.py is not working properly.
In line 115 of attn.py, we replace the get_attention_scores method for all CrossAttention modules in the unet. A different version of diffuser may result in the CrossAttention module no longer containing the get_attention_scores method. The problem you mentioned is that the diffuser version is likely incorrect.

mengmeng18 · 2023-09-26T03:48:18Z

Thanks a lot! Would you please tell me how to fix this error?

callsys · 2023-09-26T05:17:08Z

1、try pip install --upgrade diffusers[torch]==0.13.1, which is the version we use.
2、Check whether the code runs through the get_attention_scores method at line 71 of attn.py. This method adds attention maps to self.step_store, self.attention_store and self.attention_maps.

mengmeng18 · 2023-09-26T06:13:04Z

I have checked that the version is diffusers[torch]==0.13.1.
The code runs AttentionStore.register_attention_control(controller, unet) at the Line 227 of main.py, and then it does runs through the get_attention_scores method at line 71 of attn.py. However, after running these lines, I find that self.step_store, self.attention_store and self.attention_maps are still empty.
Could you give me some other advices to help me fix this error?

callsys · 2023-09-26T09:07:51Z

At line 106 in attn.py, attention_probs = controller(attention_probs, is_cross, place_in_unet) add the attention_probs into the self.step_store in the controller, you can check if the code goes through this line.

mengmeng18 · 2023-09-27T11:51:19Z

Thanks a lot! I will check it again.

KevinLi-167 · 2024-04-10T16:40:58Z

I'm having a similar issue.

I use the CUB dataset and modify the smaller batch_size for the 2-stage training.

train_token I use the default float32.
Since I only have 8g of video memory, I changed train_unet to float16, batch_size=1.
By default, float16 is used for inference.

After 250step training, an error is reported in my inference.
The reason for this is that the attention map like cam part has a value of nan.
Positioned to be the CLIP output, the last 4 of the 6 fr here are all nan.

callsys · 2024-04-11T03:27:27Z

Since CLIP(text encoder) is frozen all the time, it seems that there is a problem with the representative embeddings trained in stage 1. Does the model you trained in stage 1 nan?

Besides, the model requires a large batch size for stage 2 training. If your machine does not have enough memory, using large gradient accumulation can be fine.

KevinLi-167 · 2024-04-11T14:56:23Z

Thank you for your reply!
I did realize the problem with fr.
(And just found out that the loss is always nan in the log of training unet, so my stage2 may also be completely invalid)
I'm already trying to retrain.
(At stage1, i can't use float16 because loss will show nan .So i still use float32)

I'd like to confirm that fr relies only on the first stage of train_token ,right?
(fr is used for subsequent "training unet" and "inference" as a frozen content)

I have one more question, z0 in the paper is encoded by a VAGAN. But VAE is used in the code.
What is the possible reason for the change of image encoder from VAGAN to VAE, or why it is not the same as the paper?

Thanks again for your reply, I'll read the source code carefully again and try to train.

callsys · 2024-04-12T02:35:42Z

1、fr relies only on the train_token.

2、VQGAN is an improved version of VAE, and they are similar in structure.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Asking for questions about evaluation #6

Asking for questions about evaluation #6

mengmeng18 commented Sep 26, 2023 •

edited

Loading

callsys commented Sep 26, 2023

mengmeng18 commented Sep 26, 2023

callsys commented Sep 26, 2023

mengmeng18 commented Sep 26, 2023 •

edited

Loading

callsys commented Sep 26, 2023

mengmeng18 commented Sep 27, 2023

KevinLi-167 commented Apr 10, 2024

callsys commented Apr 11, 2024

KevinLi-167 commented Apr 11, 2024

callsys commented Apr 12, 2024

Asking for questions about evaluation #6

Asking for questions about evaluation #6

Comments

mengmeng18 commented Sep 26, 2023 • edited Loading

callsys commented Sep 26, 2023

mengmeng18 commented Sep 26, 2023

callsys commented Sep 26, 2023

mengmeng18 commented Sep 26, 2023 • edited Loading

callsys commented Sep 26, 2023

mengmeng18 commented Sep 27, 2023

KevinLi-167 commented Apr 10, 2024

callsys commented Apr 11, 2024

KevinLi-167 commented Apr 11, 2024

callsys commented Apr 12, 2024

mengmeng18 commented Sep 26, 2023 •

edited

Loading

mengmeng18 commented Sep 26, 2023 •

edited

Loading