基于自适应Stacking集成学习的散货码头船舶作业时间预测

    Prediction of Ship Operation Time in Bulk Cargo Terminals Based on Adaptive Stacking Ensemble Learning

    • 摘要: 船舶作业时间是制定泊位计划的重要依据,现有的关于船舶作业时间的研究多聚焦于集装箱码头,而散货码头由于其货物的特殊性及作业流程的复杂性,鲜有学者深入研究。考虑到散货码头不同泊位之间的相似性与差异性以及单一机器学习模型的局限性,本文首先提出了基于K-means的泊位聚类方法,将具有相似作业特性的泊位进行聚类;然后提出了结合模拟退火算法的自适应Stacking集成学习模型(SA-Stacking),针对不同类别的泊位进行船舶作业时间预测。本文基于青岛港干散货码头真实历史作业数据进行实验,实验结果表明,SA-Stacking模型能够根据不同泊位类别的数据分布特征,自适应地选择最合适的基模型组合,相比单一的机器学习模型具有更好的预测效果和泛化能力。同时,基于泊位特性分类的预测方式能更好地捕捉不同泊位的作业特点,相比于不进行泊位聚类,平均绝对误差降低约2 h,均方根误差及平均绝对百分比误差均显著降低,决定系数提高,模型解释能力和预测精度得到增强。

       

      Abstract: Ship operation time is a crucial factor in developing berth plans. While most existing research focuses on container terminals, few scholars conduct in-depth studies on bulk terminals due to the unique nature of their cargo and complex operational processes. Considering the similarities and differences among different berths at bulk cargo terminals and the limitations of a single machine learning model, this paper first proposes a berth clustering method based on K-means to group berths with similar operational traits. Then, an adaptive stacking ensemble learning combined with simulated annealing algorithm (SA-Stacking) is proposed for predicting ship operation time in various types of berths at bulk cargo terminals. This paper is based on the real historical operation data of Qingdao dry bulk terminal. Experimental results demonstrate that the SA-Stacking model can adapt to select the most appropriate combination of base models according to the data distribution characteristics of different berth categories, and has better prediction effect and generalization ability than a single machine learning model. Furthermore, the prediction approach based on berth characteristic classification can better capture the operational characteristics of different berths. Compared to prediction without berth clustering, the mean absolute error is reduced by nearly 2 hours, and both the root mean square error and the mean absolute percentage error are significantly reduced. At the same time, the R-square is increased, enhancing the model's interpretability and prediction accuracy.

       

    /

    返回文章
    返回