Replies: 5 comments
-
I have attached the images as an attachment here |
Beta Was this translation helpful? Give feedback.
-
@saket424 OK, I will try to help you solve your problem here. ^_^ |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
To resize a JPEG image using FFmpeg, you can use the following command: Replace width and height with the desired dimensions. To maintain the aspect ratio, set either the width or height to -1. For example, to set the width to 1280 pixels while preserving the aspect ratio, use: This command will automatically calculate the height based on the original aspect ratio. |
Beta Was this translation helpful? Give feedback.
-
@saket424 The problem is that in the previous code of llava, when the interpolation function calculates the coordinates, the function used is float, which is wrong and will cause coordinate access errors. When this happens at the end, it will even crash due to array out of bounds. Considering that this problem actually affects all multimodal models, I tested and submitted a small update PR. I hope this helps you. Thanks again for your discovery. Very cool. ^_^ |
Beta Was this translation helpful? Give feedback.
-
@tc-mb
Rather than call up your video understanding PR I thought I should send you the crash report here instead
Crash
No crash with smaller image
Beta Was this translation helpful? Give feedback.
All reactions