Session: 08-01-03 AI Driven Autonomous Navigation, Collision Avoidance & Optimization
Submission Number: 180183
A Method for Ship Collision Avoidance Based on Deep Reinforcement Learning Considering Uncertainty
In complex maritime environments, various types of ships coexist, resulting in overlapping routes and unpredictable movements. Such complexity can often lead to serious maritime accidents, including collisions and groundings. To ensure safe navigation, it is essential to accurately perceive the dynamic surroundings of the own ship and secure a safe route through appropriate avoidance maneuvers. Traditional collision avoidance methods, such as the VO (Velocity Obstacles) method, are advantageous for calculating real-time avoidance maneuvers by considering the potential collision risk with surrounding TSs (target ships). However, traditional methods for collision avoidance degrade significantly when the positional or velocity information of the TSs is inaccurate. In complex maritime environments, these methods may generate overly conservative routes with limited maneuverability. Moreover, maritime regulations such as the COLREGs (International Regulations for Preventing Collisions at Sea) often contain ambiguous provisions, making it difficult to integrate them effectively into traditional methods for collision avoidance.
To address these limitations, we propose a method for ship collision avoidance based on DRL (Deep Reinforcement Learning) that accounts for uncertainty. In this method, the SAC (Soft Actor-Critic) algorithm, the DRL framework is known for its stability and sample efficiency. The proposed method introduces an improved collision risk assessment method that uses the AR (Approach Rate) to calculate collision risk more smoothly and reliably than traditional CPA (Closest Point of Approach) methods. Furthermore, an attention mechanism is incorporated to effectively consider information from multiple TSs, enabling the model to prioritize critical ships based on their relative importance during avoidance decisions.
To ensure compliance with real-world navigation rules, the reward function is designed to reflect COLREGs-based constraints, encouraging rule-consistent avoidance maneuvers. During verification, uncertainties in TS positions, speeds, and headings are modeled using a bivariate normal distribution, allowing performance evaluation under realistic sensor noise. Simulation experiments demonstrate that the proposed method achieves safer, more robust collision avoidance maneuvers than traditional methods, maintaining reliability even in uncertain, dynamic maritime environments.
Keywords: Collision avoidance, DRL (Deep Reinforcement Learning), SAC (Soft Actor-Critic), Attention mechanism, Uncertainty
Presenting Author: Seong-Won Choi Seoul National University
Presenting Author Biography: Mr. Choi Seong-Won is a master’s student in the Department of Naval Architecture and Ocean Engineering at Seoul National University, working in the System Design Lab under the supervision of Professor Myung-Il Roh. His research focuses on applying deep learning and reinforcement learning to intelligent maritime systems, with particular interests in collision avoidance and path planning for autonomous surface vessels. He is currently engaged in developing learning-based control and perception algorithms for real-time decision-making in complex maritime environments.
Authors:
Seong-Won Choi Seoul National UniversityMyung-Il Roh Seoul National University
In-Chang Yeo Seoul National University
A Method for Ship Collision Avoidance Based on Deep Reinforcement Learning Considering Uncertainty
Submission Type
Technical Presentation Only