Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training Details #8

Open
anilbatra2185 opened this issue Aug 13, 2024 · 2 comments
Open

Training Details #8

anilbatra2185 opened this issue Aug 13, 2024 · 2 comments

Comments

@anilbatra2185
Copy link

hi @loveofguoke @longqian-zju

I have the following queries on training the model.

  • Do we train the model with image-text or video-text pairs? If yes, then which datasets are used for the same.
image
  • How to train with GESM data? The converted data do not have conversations key, however, the source code expect this information. Moreover after conversion, the only present keys are id, data_type, data as used in convert_data_gesm.py.
image
  • I assume we first need to train with GESM data and then perform supervised instruction fine-tuning with Moment-10M. Please confirm.

It would be really helpful if you can share the information to train the model along with appropriate dataset format.

Thanks in advance!

@longqian-zju
Copy link
Contributor

Sorry for the inconvenience.

Do we train the model with image-text or video-text pairs? If yes, then which datasets are used for the same.

By utilizing the checkpoint of Video-ChatGPT, we skip the stage of modality alignment. Some of the results from other projects indicate that it will be better to train from sketch with more recent image-text and video-text datasets, but it will be much more expensive.

How to train with GESM data? The converted data do not have conversations key, however, the source code expect this information. Moreover after conversion, the only present keys are id, data_type, data as used in convert_data_gesm.py.

I will recheck the data format. Thank you for your interest in our work!

@skhcjh231
Copy link

Have you checked the GESM data format? I also want to know how to train using GESM annotations!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants