When using temporal context network, my labeled data is the first or last frame, is this okay? #203

wyclearnpy · 2024-10-11T08:54:24Z

When I use the temporal context network, the labeled data I have is the first frame or the last frame of the video. At this time, it does not satisfy [t-2, t-1, t, t+1, t+2] and must Do you want to modify the data? Is there any other solution?
The following is the terminal printing error
RuntimeError: Not enough valid frames to make a context representation.

themattinthehatt · 2024-10-11T13:19:15Z

@wyclearnpy there shouldn't be any issue with you labeled data. If you are using frame 0, for example, the code will automatically load frames [0, 0, 0, 1, 2]. If you are using the last frame (say n), the code will automatically load frames [n-2, n-1, n, n, n].

The error that you're getting appears to be from the unlabeled data - are you fitting a semi-supervised model? If so, I would first recommend fitting a supervised context model to make sure the above error does not appear (don't need to train it out fully). If that works, then the issue is that your unlabeled batch size is too small. Since the context model requires two frames before and after the frame that is being processed, if your unlabeled batch size (in the config file under dali.context.train.batch_size) is <=4 you'll get this error. So it needs to be at least 5.

I'll make that error message more descriptive, thanks for the flag. Let me know how it goes!

themattinthehatt · 2024-10-17T18:51:24Z

@wyclearnpy just wanted to check in on this and see if you were able to train with an increased unlabeled batch size?

wyclearnpy · 2024-10-18T01:57:30Z

Yes, increasing it to 5 allowed it to run successfully, mainly due to insufficient device memory. I'm really looking forward to using multiple GPUs for training. Additionally, the performance of the semi-supervised method seems to be slightly worse compared to the supervised method. What could be the reasons for this? In the semi-supervised setup, my image size is 256, while in the supervised setup, it is 384.

themattinthehatt · 2024-10-18T16:51:17Z

@wyclearnpy glad you were able to get the unsupervised model training. One comment is that to really compare the semi-supervised and supervised models you should set the resizing to be the same, otherwise the comparison between the two will be confounded by that (very important) factor. So maybe you could try super vs semi-super on 256x256 frames first, just to get an idea of how they compare?

I have a few other questions:

how big are your original images?
how much GPU memory do you have?
how many labeled frames do you have?

wyclearnpy · 2024-10-20T10:24:24Z

My original image size is 1280x1024, gup memory is two 11g 1080ti servers, but it can only be trained with a single gpu, my label image has a total of 420 frames

wyclearnpy · 2024-10-20T10:25:19Z

Also, I'm trying semi-supervised training in 384x384 size

themattinthehatt added a commit that referenced this issue Oct 17, 2024

clearer error message when context unlabeled batch size too small (#203)

b9386a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When using temporal context network, my labeled data is the first or last frame, is this okay? #203

When using temporal context network, my labeled data is the first or last frame, is this okay? #203

wyclearnpy commented Oct 11, 2024

themattinthehatt commented Oct 11, 2024

themattinthehatt commented Oct 17, 2024

wyclearnpy commented Oct 18, 2024

themattinthehatt commented Oct 18, 2024

wyclearnpy commented Oct 20, 2024

wyclearnpy commented Oct 20, 2024

When using temporal context network, my labeled data is the first or last frame, is this okay? #203

When using temporal context network, my labeled data is the first or last frame, is this okay? #203

Comments

wyclearnpy commented Oct 11, 2024

themattinthehatt commented Oct 11, 2024

themattinthehatt commented Oct 17, 2024

wyclearnpy commented Oct 18, 2024

themattinthehatt commented Oct 18, 2024

wyclearnpy commented Oct 20, 2024

wyclearnpy commented Oct 20, 2024