工业工程 ›› 2024, Vol. 27 ›› Issue (1): 36-44,53.doi: 10.3969/j.issn.1007-7375.220247

• 系统建模与优化 • 上一篇    下一篇

基于改进DQN算法的无人仓多AGV路径规划

谢勇1, 郑绥君1, 程念胜2, 朱洪君1   

  1. 1. 华中科技大学 人工智能与自动化学院,湖北 武汉 430074;
    2. 航天信息股份有限公司,北京 100195
  • 收稿日期:2022-12-08 发布日期:2024-03-05
  • 通讯作者: 郑绥君 (1998—),女,湖南省人,硕士研究生,主要研究方向为多智能体的调度优化与路径规划。Email:zheng893724451@163.com E-mail:zheng893724451@163.com
  • 作者简介:谢勇 (1974—),男,湖北省人,副教授,主要研究方向为智慧物流、优化调度、智能制造
  • 基金资助:
    国家自然科学基金资助面上项目 (71771096);国家自然科学基金创新群体资助项目 (71821001)

Multi-AGV Route Planning for Unmanned Warehouses Based on Improved DQN Algorithm

XIE Yong1, ZHENG Suijun1, CHENG Niansheng2, ZHU Hongjun1   

  1. 1. School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China;
    2. Aerospace Information Co., Ltd, Beijing 100195, China
  • Received:2022-12-08 Published:2024-03-05

摘要: 针对无人仓中多AGV路径规划与冲突问题,以最小化总行程时间为目标,建立多AGV路径规划模型,提出一种基于动态决策的改进DQN算法。算法设计了基于单AGV静态路径规划的经验知识模型,指导AGV的学习探索方向,提前规避冲突与障碍物,加快算法收敛。同时提出基于总行程时间最短的冲突消解策略,从根本上解决多AGV路径冲突与死锁问题。最后,建立无人仓栅格地图进行仿真实验。结果表明,本文提出的模型和算法较其他DQN算法收敛速度提升13.3%,平均损失值降低26.3%。这说明该模型和算法有利于规避和化解无人仓多AGV路径规划冲突,减少多AGV总行程时间,对提高无人仓作业效率具有重要指导意义。

关键词: 多AGV, 路径规划, DQN算法, 经验知识, 冲突消解

Abstract: To solve the problem of multi-AGV route planning and conflicts in unmanned warehouses, with the objective of minimizing the total travel time, a multi-AGV route planning model is established, and an improved DQN algorithm based on dynamic decision-making is proposed. An empirical knowledge model based on static route planning of a single AGV is designed to guide the learning and exploration direction of AGVs. It avoids conflicts and obstacles for AGVs in advance, and accelerates the convergence of the proposed algorithm. Also, a conflict resolution strategy based on the shortest total travel time is proposed to fundamentally solve the problem of multi-AGV route conflicts and deadlocks. Finally, a grid map of an unmanned warehouse is established for simulation experiments. Results show that, compared with other DQN algorithms, the convergence speed of the proposed model and algorithm is increased by 13.3%, and the average loss value is reduced by 26.3%. This result indicates that the model and algorithm are conducive to avoiding and resolving the conflicts of multi-AGV route planning in unmanned warehouses, reducing the total travel time of multiple AGVs and having important guiding significance to improve the efficiency of unmanned warehouse operations.

Key words: multiple AGVs, route planning, DQN algorithm, empirical knowledge, conflict resolution

中图分类号: