GitHub - BayKeremm/thesis-code

Summary
This work presents a pipelined approach for a camera detection system to detect cones in frames and recover their 3D position. It employs an object detection model, a keypoint regression model, and the Perspective-n-Point algorithm.

The system achieves sub-80ms latency and <0.5m errors at 10m.

Requirements:

Ubuntu 20.04 LTS
CUDA 12.1
OpenCV 4.7
Libtorch CUDA 11.7 cxx11 ABI
ROS Noetic
Build tool: Catkin

How to run this code:

Initialize ros using:

roscore
Build the packages: in a separate terminal navigate to the root of your catkin workspace (~/my_catkin_ws/) and build the package using:

catkin build image_acquisition_package image_processing_package
Source the workspace: after successfully building your package, source the workspace to make the package visible to ROS

source ~/my_catkin_ws/devel/setup.bash
Run the exectuable/node defined in CMakeLists.txt:

rosrun image_processing_package vision_node
Play the rosbag: in a separate tab, go to the directory containing rosbags to test and run:

rosbag play your_bag_file.bag
See the output in terminal were packages are running. Alternatively, if output is set to write to file, wait for completition of rosbag and check out folder with outputs.

Note: The trained neural networks are not included in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
image_acquisition_package		image_acquisition_package
image_processing_package		image_processing_package
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

BayKeremm/thesis-code

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages