Skip to content

EventGPT: Event Stream Understanding with Multimodal Large Language Models

Notifications You must be signed in to change notification settings

XduSyL/EventGPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 

Repository files navigation

logo

EventGPT: Event Stream Understanding with Multimodal Large Language Models

1Shaoyu Liu, 2Jianing Li, 1Guanghui Zhao, 3Yunjian Zhang,
4Xin Meng, 5Fei Richard Yu, 2Xiangyang Ji, 5Ming Li

1Xidian University, 2Tsinghua University, 3UCAS, 4Peking University
5Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)

EventGPT is an innovative multimodal large language model(MLLM) that integrates event streams and text, founded on spatio-temporal representations and advanced language modeling methodologies.

🏠 About

Video demo of EventGPT.

EventGPT

The EventGPT model, along with the N-ImageNet-Chat and Event-Chat datasets, will be released after the acceptance of our paper.

🔥 News

[2024-12-03] 🎥 The video demo of EventGPT is live! See it in action!

[2024-12-02] 🌐 The project page is now online. Explore EventGPT in depth.

[2024-12-01] 📄 Check out our paper on arXiv and discover the details of EventGPT! 🎉