GitHub - jiawen-zhu/TrackGPT: Tracking with Human-Intent Reasoning

InsTrack dataset is avaliable at google drive

TrackGPT is a new tracking architecture that is capable of performing complex reasoning-based tracking by injecting (Large Vision-Language Model) LVLM's comprehension of the multi-modal world. A new tracking task, Instruction Tracking (InsT), is proposed simultaneously in a context where perception and comprehension tasks are no longer clearly demarcated. We undertake this modest attempt to advance next-generation object tracking with more intelligence.

📢News

[2023/12/31] We make TrackGPT public.

🔥 Highlight

A new task called instruction tracking (InsT) is proposed, where a tracker must have the self-reasoning capability, autonomously interpret implicit instruction and track the target object. This human-tracker interaction paradigm aligns better with the way humans ask questions.
A benchmark, InsTrack, is also constructed for instruction tuning and evaluation.
This work present TrackGPT, a tracker that can comprehend human intent by leveraging the reasoning capability of LVLM. TrackGPT is designed sticking to a principle of simple yet effective, we hope this work could catalyze more compelling research in the future.

📝 Results

Referring Tracking

Instruction Tracking

📑Installation

Install the conda environment

conda create -n trackgpt python=3.9
conda activate trackgpt

Install the required packages:

pip install -r requirements.txt
pip install flash-attn --no-build-isolation

🚗Run TrackGPT

sh TrackGPT_demo.sh

For example，

Please input your tracking instrcution: I'd like to focus on the protagonist of this street event. Please track the object.
Please input the video path: test_videos/breakdance

♥️ Acknowledgment

This project is based on LISA and LLaVA. Thanks for these excellent works.

📖 Citation

If you find TrackGPT useful for you, please consider citing 📣

@misc{trackgpt,
      Title={Tracking with Human-Intent Reasoning}, 
      Author = {Jiawen Zhu and Zhi-Qi Cheng and Jun-Yan He and Chenyang Li and Bin Luo and Huchuan Lu and Yifeng Geng and Xuansong Xie},
      Year = {2023},
      Eprint = {arXiv:2312.17448},
      PrimaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📢News

🔥 Highlight

📝 Results

📑Installation

🚗Run TrackGPT

♥️ Acknowledgment

📖 Citation

About

Releases

Packages

License

jiawen-zhu/TrackGPT

Folders and files

Latest commit

History

Repository files navigation

📢News

🔥 Highlight

📝 Results

📑Installation

🚗Run TrackGPT

♥️ Acknowledgment

📖 Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages