论文标题
监督置换不变网络,用于解决有界机队尺寸的CVRP
Supervised Permutation Invariant Networks for Solving the CVRP with Bounded Fleet Size
论文作者
论文摘要
学习解决组合优化问题(例如车辆路由问题)比经典操作研究求解器和启发式方法具有很大的计算优势。最近开发的深入增强学习方法可以改善最初给定的迭代解决方案或依次构建一组单独的旅行。但是,大多数现有基于学习的方法无法为固定数量的车辆工作,因此绕过客户的复杂分配问题,可以在给定数量的可用车辆上。另一方面,这使它们不适合实际应用,因为许多物流服务提供商都依赖于针对特定有限的车队尺寸的解决方案,并且无法适应车辆数量的短期更改。相比之下,我们提出了一个强大的监督深度学习框架,该框架从头开始构建完整的旅游计划,同时尊重Apriori固定数量的可用车辆。结合有效的后处理计划,我们的监督方法不仅更快,更容易训练,而且还取得了竞争成果,以结合车辆成本的实际方面。在彻底的受控实验中,我们将我们的方法与多种最先进的方法进行了比较,在这些方法中,我们表现出稳定的性能,同时使用较少的车辆并阐明了相关工作的实验协议中存在的不一致之处。
Learning to solve combinatorial optimization problems, such as the vehicle routing problem, offers great computational advantages over classical operations research solvers and heuristics. The recently developed deep reinforcement learning approaches either improve an initially given solution iteratively or sequentially construct a set of individual tours. However, most of the existing learning-based approaches are not able to work for a fixed number of vehicles and thus bypass the complex assignment problem of the customers onto an apriori given number of available vehicles. On the other hand, this makes them less suitable for real applications, as many logistic service providers rely on solutions provided for a specific bounded fleet size and cannot accommodate short term changes to the number of vehicles. In contrast we propose a powerful supervised deep learning framework that constructs a complete tour plan from scratch while respecting an apriori fixed number of available vehicles. In combination with an efficient post-processing scheme, our supervised approach is not only much faster and easier to train but also achieves competitive results that incorporate the practical aspect of vehicle costs. In thorough controlled experiments we compare our method to multiple state-of-the-art approaches where we demonstrate stable performance, while utilizing less vehicles and shed some light on existent inconsistencies in the experimentation protocols of the related work.
