基于深度强化学习的柔性作业车间节能调度研究

doi:10.3969/j.issn.1007-7375.230101

工业工程 ›› 2024, Vol. 27 ›› Issue (1): 78-85,103.doi: 10.3969/j.issn.1007-7375.230101

基于深度强化学习的柔性作业车间节能调度研究

张中伟, 李艺, 高增恩, 武照云

河南工业大学机电工程学院河南省超硬磨料磨削装备重点实验室，河南郑州 450001

收稿日期:2023-05-18 发布日期:2024-03-05
通讯作者: 武照云 (1981—)，男，辽宁省人，教授，博士，主要研究方向为制造业信息化、智能制造。Email: wuzhaoyun@haut.edu.cn E-mail:wuzhaoyun@haut.edu.cn
作者简介:张中伟 (1987—)，男，河南省人，副教授，博士，主要研究方向为低碳制造、智能制造
基金资助:
国家自然科学基金资助项目 (U1704156)；河南省科技攻关计划资助项目 (212102210357)；河南省高等学校重点科研资助项目 (23A460003)；河南省超硬磨料磨削装备重点实验室开放课题资助项目 (JDKFJJ2022012)

Energy-efficient Flexible Job-shop Scheduling Based on Deep Reinforcement Learning

ZHANG Zhongwei, LI Yi, GAO Zengen, WU Zhaoyun

Henan Key Laboratory of Superhard Abrasives and Grinding Equipment, School of Mechanical & Electrical Engineering, Henan University of Technology, Zhengzhou 450001, China

Received:2023-05-18 Published:2024-03-05

摘要/Abstract

摘要： 针对当前柔性作业车间节能调度研究无法充分利用历史生产数据，且对复杂、动态、多变的车间生产环境适应性不足的问题，引入深度强化学习思想，利用具有代表性的深度Q网络（deep Q-network, DQN）求解柔性作业车间节能调度问题。将柔性作业车间节能调度问题转化为强化学习对应的马尔科夫决策过程。进而，提炼表征车间生产状态特征的状态值作为神经网络输入，通过神经网络拟合状态值函数，输出复合调度动作规则实现对工件以及加工机器的选择，并利用动作规则与奖励函数协同优化能耗。在3个不同规模的案例上与非支配排序遗传算法、超启发式遗传算法、改进狼群算法等典型智能优化方法进行求解效果对比。结果表明，DQN算法有较强的搜索能力，且最优解分布情况与提出的柔性作业车间节能调度模型聚焦能耗目标相一致，从而验证了所用DQN方法的有效性。

关键词: 柔性作业车间节能调度, 深度强化学习, 深度Q网络, 马尔科夫决策

Abstract: The current research on energy-efficient flexible job-shop scheduling problems (EFJSPs) cannot make full use of historical production data, and is insufficiently adaptable to the complex, dynamic and changeable job-shop production environment. In view of this, deep reinforcement learning is introduced to solve EFJSPs, where a representative method named deep Q-network (DQN) is utilized. First, EFJSP is transformed into a Markov decision process corresponding to reinforcement learning. Moreover, the state values characterizing the job-shop production states are extracted as inputs of a neural network. By fitting the state value function through the neural network, compound scheduling action rules are output to achieve the selection of workpieces and processing machines. Furthermore, scheduling action rules and reward functions are utilized to jointly optimize the total production energy consumption. Finally, solutions of the proposed method are compared with those using typical intelligent optimization algorithms, such as non-dominated sorting genetic algorithm, hyper-heuristic genetic algorithm and multi-objective wolf pack algorithm, in three cases with different scales. Results demonstrate the powerful search capability of DQN algorithm, and the distribution of optimal solutions is consistent with the optimization objective obtained by the proposed EJFSP model. These verify the effectiveness of the utilized DQN method.

Key words: energy-efficient flexible job-shop scheduling, deep reinforcement learning, deep Q-network, Markov decision

中图分类号:

F406.2
TH166

张中伟, 李艺, 高增恩, 武照云. 基于深度强化学习的柔性作业车间节能调度研究[J]. 工业工程, 2024, 27(1): 78-85,103.

ZHANG Zhongwei, LI Yi, GAO Zengen, WU Zhaoyun. Energy-efficient Flexible Job-shop Scheduling Based on Deep Reinforcement Learning[J]. Industrial Engineering Journal, 2024, 27(1): 78-85,103.

参考文献

[1] 王凌, 王晶晶, 吴楚格. 绿色车间调度优化研究进展[J]. 控制与决策, 2018, 33(3): 385-391
WANG Ling, WANG Jingjing, WU Chuge. Advances in green shop scheduling and optimization[J]. Control and Decision, 2018, 33(3): 385-391
[2] PIROOZFARD H, WONG K Y, WONG W P. Minimizing total carbon footprint and total late work criterion in flexible job shop scheduling by using an improved multi-objective genetic algorithm[J]. Resources Conservation and Recycling, 2018, 128(1): 267-283
[3] SENG D W, LI J W, FANG X J, et al. Low-carbon flexible job-shop scheduling based on improved nondominated sorting genetic algorithm-II[J]. International Journal of Simulation Modelling, 2018, 17(4): 712-723
[4] LUO S, ZHANG L, FAN Y. Energy-efficient scheduling for multi-objective flexible job shops with variable processing speeds by grey wolf optimization[J]. Journal of Cleaner Production, 2019, 234(10): 1365-1384
[5] CALDEIRA R H, GNANAVELBABU A, VAIDYANATHAN T. An effective backtracking search algorithm for multi-objective flexible job shop scheduling considering new job arrivals and energy consumption[J]. Computers & Industrial Engineering, 2020, 149: 106863
[6] 李益兵, 黄炜星, 吴锐. 基于改进人工蜂群算法的多目标绿色柔性作业车间调度研究[J]. 中国机械工程, 2020, 31(11): 1344-1350
LI Yibing, HUANG Weixing, WU Rui. Research on multi-objective green flexible job-shop scheduling based on improved ABC algorithm[J]. China Mechanical Engineering, 2020, 31(11): 1344-1350
[7] SANG Y, TAN J. Many-objective flexible job shop scheduling problem with green consideration[J]. Energies, 2022, 15(5): 1884
[8] 张洁, 高亮, 秦威, 等. 大数据驱动的智能车间运行分析与决策方法体系[J]. 计算机集成制造系统, 2016, 22(5): 1220-1228
ZHANG Jie, GAO Liang, QIN Wei, et al. Big-data-driven operational analysis and decision-making methodology in intelligent workshop[J]. Computer Integrated Manufacturing Systems, 2016, 22(5): 1220-1228
[9] HAN B A, YANG J J. Research on adaptive job shop scheduling problems based on dueling double DQN[J]. IEEE Access, 2020, 8(10): 186474-186495
[10] HAN B A, YANG J J. A deep reinforcement learning based solution for flexible job shop scheduling problem[J]. International Journal of Simulation Modelling, 2021, 20(2): 375-386
[11] PARK J, CHUN J, KIM S H, et al. Learning to schedule job-shop problems: representation and policy learning using graph neural network and reinforcement learning[J]. International Journal of Production Research, 2021, 59(11): 3360-3377
[12] ZHOU T, TANG D, ZHU H, et al. Reinforcement learning with composite rewards for production scheduling in a smart factory[J]. IEEE Access, 2021, 9(1): 752-766
[13] CHANG J, YU D, HU Y, et al. Deep reinforcement learning for dynamic flexible job shop scheduling with random job arrival[J]. Processes, 2022, 10(4): 760
[14] ZHANG M, LU Y, HU Y, et al. Dynamic scheduling method for job-shop manufacturing systems by deep reinforcement learning with proximal policy optimization[J]. Sustainability, 2022, 14(9): 5177
[15] WANG H, JIANG Z, WANG Y, et al. A two-stage optimization method for energy-saving flexible job-shop scheduling based on energy dynamic characterization[J]. Journal of Cleaner Production, 2018, 188(7): 575-588
[16] 屈新怀, 纪飞, 孟冠军, 等. 超启发式遗传算法柔性作业车间绿色调度问题研究[J]. 机电工程, 2022, 39(2): 255-261
QU Xinhuai, JI Fei, MENG Guanjun, et al. Green scheduling of flexible job-shop based on hyper heuristic genetic algorithm[J]. Journal of Mechanical & Electrical Engineering, 2022, 39(2): 255-261
[17] ZHANG Z, WU L, PENG T, et al. An improved scheduling approach for minimizing total energy consumption and makespan in a flexible job shop environment[J]. Sustainability, 2019, 11(1): 179
[18] 张国辉, 党世杰. 遗传算法求解低碳柔性车间生产调度问题[J]. 组合机床与自动化加工技术, 2016(11): 141-144
ZHANG Guohui, DANG Shijie. Genetic algorithm for solving flexible job shop scheduling problem with low carbon[J]. Modular Machine Tool & Automatic Manufacturing Technique, 2016(11): 141-144
[19] 杨立熙, 王秀萍. 考虑低碳的柔性作业车间调度问题研究[J]. 组合机床与自动化加工技术, 2018(6): 168-176
YANG Lixi, WANG Xiuping. Research on flexible job shop scheduling problem considering low carbon[J]. Modular Machine Tool & Automatic Manufacturing Technique, 2018(6): 168-176
[20] 张朝阳. 基于能耗优化的柔性作业车间调度方法研究[D]. 洛阳: 河南科技大学, 2022.

基于深度强化学习的柔性作业车间节能调度研究

Energy-efficient Flexible Job-shop Scheduling Based on Deep Reinforcement Learning

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 1

编辑推荐

Metrics

本文评价