GitHub - Superjie13/StereoTracking: This repository is an implementation of the StereoTracking presented in the paper "Drone-based Depth-Aware Airborne Object Detection and Tracking"

StereoTracking

This repository is an implementation of the StereoTracking presented in the paper "Drone-based Depth-Aware Airborne Object Detection and Tracking"

Our code is based on the mmTracking.

Introduction

we present StereoTracking, a novel detection-based tracking framework specifically designed to enhance the perception and tracking of airborne targets by leveraging depth cues from stereo vision systems. The key components includes a Multi-modal Detector, Local-to-Global Motion Field Estimation, Adaptive Truncated Mean (ATM), and Distance-aware Online Scaling (DAOS), which can be seamlessly integrate into existing tracking systems and work synergistically to enhance detection performance, compensate for camera motion, and adapt to varying target distances.

Installation

conda create -n stereoTracking python=3.9.0 pip=22.2.2
conda activate stereoTracking
# Install pytorch
conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.7 -c pytorch -c nvidia
# Install MMEngine, MMCV, MMDet, MMYOLO. Be aware that this must be done _after_ installing PyTorch, otherwise openmim will skip the installation of some components.
pip install -U openmim
mim install "mmengine==0.10.3" "mmcls==1.0.0rc4" "mmcv==2.0.0rc3" "mmdet==3.0.0rc4" "mmyolo==0.2.0"
# Clone the repository
git clone https://github.com/Superjie13/StereoTracking.git
cd StereoTracking
# Install build requirements and build StereoTracking
  pip install -r requirements/build.txt
  pip install -v -e .  # or "python setup.py develop"
# For MOT Evaluation
pip install git+https://github.com/JonathonLuiten/TrackEval.git

Environment

Ubuntu 20.04
Python 3.9
PyTorch 1.13.1
GPU RTX 4090 (24G)

AirSim-Drone Dataset

Please download our dataset from the link below (all files), unzip the left/right nested zips, and and build a soft link to the data folder.

Sample of 1 sequence: download (1.6 GB)
Video previews of all sequences: download (330 MB)
Full dataset, 64 sequences: download (141 GB, 15 files)
- To unzip, place all folders in the same dir, then run zip -FF airdrone_full.zip --out airdrone_full_all.zip; unzip -FF airdrone_full_all.zip. This will create a 141 GB contiguous file, and then unzip it, requiring 425 GB of space. Alternatively, use 7-Zip to avoid generating the contiguous copy of the zip.

Annotation

Annotation format is similar to MOT-Challenge: frame_id, id, x, y, w, h, confidence, X, Y, Z, class

frame_id: frame number
id: object id
x, y, w, h: bounding box coordinates
confidence: detection confidence
X, Y, Z: 3D location of the object
class: object class

We provide the coco-style annotation in the annotations folder for simple usage.

mkdir data
ln -s /path/to/your/dataset data/

Data Structure

data
├── AirSim_drone
│   ├── annotations
│   |  ├── train_cocoformat_[xx].json
│   |  ├── val_cocoformat_[xx].json
│   |  |── ...
│   ├── train
|   |  ├── video_sequence
|   |  |  ├── depth
|   |  |  ├── disparity
|   |  |  ├── left
|   |  |  ├── right
|   |  |  ├── ...
│   |  |── ...
│   ├── val
|   |  ├── video_sequence
|   |  |  ├── depth
|   |  |  ├── disparity
|   |  |  ├── left
|   |  |  ├── right
|   |  |  ├── ...

Position Statistics

We provide the position statistics of training and validation set in our AirSim-Drone dataset.

Testing

To test the StereoTracking on the AirSim-Drone dataset, download the pretrained model from here and run the following command:

python tools/test.py \
  configs/stereo_tracking/ocsort/yolox_s_mmyolo_mot_airsim_drone_disp.py \
  --checkpoint [path/to/checkpoint] \
  --work-dir [path/to/your/workdir]

Training

python tools/train.py \
configs/stereo_tracking/ocsort/yolox_s_mmyolo_mot_airsim_drone_disp.py \
--work-dir [path/to/your/workdir]

Export to ONNX

To convert the StereoTracking model to ONNX, you can refer to ONNX Model Export in the YOLOX-Deploy-Test---multi-modal-inputs repository.

Note: StereoTracking model is trained with the 'mmtracking' framework, to export the detection model, you need to first extract the 'detector' part from the model and then export it to ONNX.

python tools/utils/extract_det_model.py path/to/checkpoint

A toy video from `Drone-vs-Bird` dataset.

To show the effectiveness of our methods even without depth information, i.e., only leveraging our LGME. We provide a toy video from the drone-vs-bird dataset.

drone.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
mmtrack		mmtrack
requirements		requirements
src		src
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StereoTracking

Introduction

Getting Started

Installation

Environment

AirSim-Drone Dataset

Annotation

Data Structure

Position Statistics

Testing

Training

Export to ONNX

A toy video from `Drone-vs-Bird` dataset.

About

Releases

Packages

Contributors 2

Languages

License

Superjie13/StereoTracking

Folders and files

Latest commit

History

Repository files navigation

StereoTracking

Introduction

Getting Started

Installation

Environment

AirSim-Drone Dataset

Annotation

Data Structure

Position Statistics

Testing

Training

Export to ONNX

A toy video from Drone-vs-Bird dataset.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

A toy video from `Drone-vs-Bird` dataset.

Packages