This repository provides awesome research papers for autonomous driving perception.
I have tried my best to keep this repository up to date. If you do find a problem or have any suggestions, please raise this as
an issue or make a pull request with information (format of the repo): Research paper title, datasets, metrics, objects, source code, publisher, and year.
This summary is categorized into:
- Datasets
- LiDAR-based 3D Object Detection
- Monocular Image-based 3D Object Detection
- LiDAR and RGB Images fusion
- Pseudo-LiDAR
- Training tricks
Abbreviations
- AP-2D: Average Precision for 2D detection (on RGB-image space)
- AP-3D: Average Precision for 3D detection
- AP-BEV: Average Precision for Birds Eye View
- AOS: Average Orientation Similarity (if 2D bounding box available)
Research Paper | Datasets | Metrics | Objects | Source Code | Publisher | Year |
---|---|---|---|---|---|---|
RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving |
|
|
Cars | PyTorch | ECCV | 2020 |
Stereo R-CNN based 3D Object Detection for Autonomous Driving |
|
|
Cars | PyTorch | CVPR | 2019 |
M3D-RPN: Monocular 3D Region Proposal Network for Object Detection |
|
|
Cars, Pedestrians, Cyclists | PyTorch | ICCV | 2019 |
Mono3D++: Monocular 3D Vehicle Detection with Two-Scale 3D Hypotheses and Task Priors |
|
|
Cars, Pedestrians, Cyclists | --- | ArXiv | 2019 |
3D Bounding Box Estimation Using Deep Learning and Geometry |
|
|
Cars, Cyclists | PyTorch | CVPR | 2017 |
Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image |
|
Cars | --- | CVPR | 2017 | |
Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image |
|
|
Cars | Link | ICRA | 2017 |
Research Paper | Datasets | Metrics | Objects | Source Code | Publisher | Year |
---|---|---|---|---|---|---|
ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes |
|
|
37 object categories | PyTorch | CVPR | 2020 |
Multi-Task Multi-Sensor Fusion for 3D Object Detection |
|
|
Cars, Pedestrians, Cyclists | PyTorch | CVPR | 2019 |
Research Paper | Datasets | Metrics | Objects | Source Code | Publisher | Year |
---|---|---|---|---|---|---|
Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving |
|
|
Cars, Pedestrians, Cyclists | PyTorch | ICLR | 2020 |
Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving |
|
|
Cars, Pedestrians, Cyclists | PyTorch | CVPR | 2019 |
Research Paper | Datasets | Metrics | Objects | Source Code | Publisher | Year |
---|---|---|---|---|---|---|
PPBA: Improving 3D Object Detection through Progressive Population Based Augmentation |
|
|
Cars, Pedestrians, Cyclists | --- | ArXiv | 2020 |
Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection |
|
|
10 object categories | PyTorch | ArXiv | 2019 |
Weighted Point Cloud Augmentation for Neural Network Training Data Class-Imbalance |
|
--- | ArXiv | 2019 |
To do list:
- Add 3D object detection papers based on LiDAR/monocular images/fusion/pseudo-LiDAR.
- Add training tricks papers
- Add object tracking papers.
- Provide
download.py
script to automatically download.pdf
files.
- The format of the README has been referred from RedditSota/state-of-the-art-result-for-machine-learning-problems