"Solving the Traveling Salesman Problem with Drones Using Proximal Policy Optimization and Deep Reinforcement Learning"

Ali Abdul Razzaq Taresh

doi:10.52783/jisem.v10i48s.10105

PDF

Published: May 19, 2025

DOI: https://doi.org/10.52783/jisem.v10i48s.10105

Keywords:

Multi-agent reinforcement learning, TSP-D, PPO, truck-drone delivery, deep residual networks, logistics optimization, decentralized policy learning.

Ali Abdul Razzaq Taresh, Asghar A. Asgharian Sardroud, Mir Saman Tajbakhsh

Abstract

The increasing demand for scalable and efficient last-mile delivery has prompted the integration of drones with trucks in hybrid logistics systems. While reinforcement learning (RL) methods have shown promise in addressing the Traveling Salesman Problem with Drones (TSP-D), most approaches focus on single truck-drone coordination, limiting their real-world applicability. This paper introduces a novel multi-agent reinforcement learning (MARL) framework based on Proximal Policy Optimization (PPO) to address a multi-truck multi-drone TSP-D scenario. Each agent (truck or drone) learns a decentralized policy with a shared global reward, enabling real-time, cooperative route planning. An enhanced state representation captures vehicle positions, battery constraints, and inter-agent interactions. The actor-critic network employs deep residual layers and agent identity encoding to support dynamic adaptation and coordination. A Dijkstra-based module ensures drone reachability under energy constraints, while a task allocation mechanism balances delivery loads and prevents conflicts. Experiments on synthetic and real-world-inspired datasets demonstrate the proposed model’s superiority over single-agent PPO and classical metaheuristics in terms of delivery time, energy efficiency, and scalability. As the number of agents and delivery nodes grows, the system maintains high performance, demonstrating strong potential for real-time autonomous logistics in urban environments.

Issue

Vol. 10 No. 48s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

"Solving the Traveling Salesman Problem with Drones Using Proximal Policy Optimization and Deep Reinforcement Learning"

Abstract

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details