Deep Reinforcement Learning for autonomous UAV navigation.

Demo

Problem Statement

Perception and Autonomy for UAVs in tight indoor spaces is a highly challenging task especially in two particular scenarios:

No prior maps.
No GPS coordinates.

The goal is to autonomously navigate through the below Airsim environment which has circular holes carved on the walls. The robot/agent should traverse maximum number of holes by predicting the continuous actions(v) based on input camera observations(I) without colliding to any of the walls.

Environment

Code

1. To start conda environment:

$ conda activate rl0

2. Training

To start training the poilcy(PPO or SAC)

$ python trainPPO.py or
$ python trainSAC.py

3. Log Monitoring

To monitor logs with tensorboard use below command by replace with your own logdir:

$ tensorboard --logdir tb_logs/ppo_run_1733388980.4439118_1

4. Inference

$ python inference.py

References

"Reaching the limit in autonomous racing: Optimal control versus reinforcement learning." Song, Yunlong, Angel Romero, Matthias Müller, Vladlen Koltun, and Davide Scaramuzza. Science Robotics, (2023)
"Proximal policy optimization algorithms", Schulman, J., Wolski, F., Dhariwal, P., Radford, A., & Klimov, O., arXiv preprint arXiv:1707.06347. (2017).
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor", Haarnoja, Tuomas, Aurick Zhou, Pieter Abbeel, and Sergey Levine, In International conference on machine learning, pp. 1861-1870. PMLR, 2018.
"Deep-Reinforcement-Learning-Based Autonomous UAV Navigation With Sparse Rewards," C. Wang, J. Wang, J. Wang and X. Zhang, in IEEE Internet of Things Journal, vol. 7, no. 7, pp. 6180-6190, July 2020.
"Autonomous UAV navigation via deep reinforcement learning using PPO." Kabas, Bilal, 2022 30th Signal Processing and Communications Applications Conference (SIU). IEEE, 2022.
"Autonomous uav navigation using reinforcement learning.", Pham, Huy X., et al, arXiv preprint arXiv:1801.05086 (2018).

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
scripts		scripts
.gitignore		.gitignore
ReadMe.md		ReadMe.md
config.yml		config.yml
env.yml		env.yml
inference.py		inference.py
trainPPO.py		trainPPO.py
trainSAC.py		trainSAC.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning for autonomous UAV navigation.

Demo

Problem Statement

Environment

Code

1. To start conda environment:

2. Training

3. Log Monitoring

4. Inference

References

License

About

Releases

Packages

Languages

pranay-junare/RL-UAV

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning for autonomous UAV navigation.

Demo

Problem Statement

Environment

Code

1. To start conda environment:

2. Training

3. Log Monitoring

4. Inference

References

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages