Deep Reinforcement Learning for drone obstacle avoidance

Loc Pham, Thanh; Loc Pham, Thanh

Deep Reinforcement Learning for drone obstacle avoidance

To access the full text documents, please follow this link: https://hdl.handle.net/10256/28367

Author

Loc Pham, Thanh

Other authors

Vasiljević, Goran

Manen, Benjamin van

Publication date

2025

Abstract

Unmanned Aerial Vehicles (UAVs) are increasingly deployed in autonomous mis sions across complex, cluttered environments where reliable obstacle avoidance is crit ical. Traditional navigation frameworks rely on modular pipelines—separating percep tion, mapping, planning, and control—which often suffer from error accumulation, high computational overhead, and poor reactivity in dynamic scenarios. To address these lim itations, this thesis investigates an end-to-end deep reinforcement learning (DRL) frame work for real-time UAV obstacle avoidance using onboard depth sensing. We compare two state-of-the-art DRL algorithms, Proximal Policy Optimization (PPO) and Twin Delayed DDPG (TD3), in a continuous control setting, evaluating their train ing dynamics and performance in diverse simulated environments. Our initial experi ments highlight key failure modes such as collisions with overhead obstacles and dead end traps, caused by the policy’s limited temporal awareness. To overcome these, we propose a neural architecture that incorporates both a pretrained ResNet8-based depth encoder and two temporal reasoning mechanisms: (1) an LSTM module for recurrent memory, and (2) a stacked buffer of recent depth observations. This temporal augmen tation allows the agent to recover from occlusions and partial observability, significantly improving navigation robustness. Trained in a curriculum-based Gym-PyBullet-Drones environment, our final memory based policy achieves a 96% success rate across randomized 3D obstacle courses and out performs EGO-Planner-v2 in both success rate and adaptability. The results demonstrate that DRL policies with temporal context can match or exceed the performance of tradi tional planning pipelines while offering greater generalization and simplicity in deploy ment.

Document Type

Master's final project

Language

English

Subjects and keywords

DRL (Deep Reinforcement Learning); Machine learning; Aprenentatge profund (Aprenentatge automàtic); Vehicles aeris autònoms; Autonomous aerial vehicles; UAV (Vehicle aeri no tripulat); Drone aircraft; Robots -- Sistemes de navegació; Robots -- Navigation systems; Obstacle avoidance

Publisher

Universitat de Girona. Institut de Recerca en Visió per Computador i Robòtica

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

http://creativecommons.org/licenses/by-nc-nd/4.0/

This item appears in the following Collection(s)

Treballs de màsters i postgraus [843]

Deep Reinforcement Learning for drone obstacle avoidance

Author

Other authors

Publication date

Share

Abstract

Document Type

Language

Subjects and keywords

Publisher

Recommended citation

Export

Rights

This item appears in the following Collection(s)