A Scheme Library-Based Ant Colony Optimization with 2-Opt Local Search for Dynamic Traveling Salesman Problem

2023-02-26 10:17ChuanWangRuoyuZhuYiJiangWeiliLiuSangWoonJeonLinSunandHuaWang

Chuan Wang,Ruoyu Zhu,Yi Jiang,Weili Liu,Sang-Woon Jeon,Lin Sun and Hua Wang

1College of Software,Henan Normal University,Xinxiang,453007,China

2College of Computer and Information Engineering,Henan Normal University,Xinxiang,453007,China

3School of Computer Science and Engineering,South China University of Technology,Guangzhou,510006,China

4School of Computer Science,Guangdong Polytechnic Normal University,Guangzhou,510665,China

5Department of Military Information Engineering,Hanyang University,Ansan,15588,South Korea

6College of Engineering and Science,Institute for Sustainable Industries and Liveable Cities,Victoria University,Melbourne,VIC 8001,Australia

ABSTRACT The dynamic traveling salesman problem(DTSP)is significant in logistics distribution in real-world applications in smart cities,but it is uncertain and difficult to solve.This paper proposes a scheme library-based ant colony optimization(ACO)with a two-optimization(2-opt)strategy to solve the DTSP efficiently.The work is novel and contributes to three aspects:problem model,optimization framework,and algorithm design.Firstly,in the problem model,traditional DTSP models often consider the change of travel distance between two nodes over time,while this paper focuses on a special DTSP model in that the node locations change dynamically over time.Secondly,in the optimization framework,the ACO algorithm is carried out in an offline optimization and online application framework to efficiently reuse the historical information to help fast respond to the dynamic environment.The framework of offline optimization and online application is proposed due to the fact that the environmental change in DTSP is caused by the change of node location,and therefore the new environment is somehow similar to certain previous environments.This way,in the offline optimization,the solutions for possible environmental changes are optimized in advance,and are stored in a mode scheme library.In the online application,when an environmental change is detected,the candidate solutions stored in the mode scheme library are reused via ACO to improve search efficiency and reduce computational complexity.Thirdly,in the algorithm design,the ACO cooperates with the 2-opt strategy to enhance search efficiency.To evaluate the performance of ACO with 2-opt,we design two challenging DTSP cases with up to 200 and 1379 nodes and compare them with other ACO and genetic algorithms.The experimental results show that ACO with 2-opt can solve the DTSPs effectively.

KEYWORDS Dynamic traveling salesman problem(DTSP);offline optimization and online application;ant colony optimization(ACO);two-optimization(2-opt)strategy

1 Introduction

The traveling salesman problem(TSP)is one of the most fundamental and intensely studied NPcomplete combinatorial optimization problems [1,2].TSP is very important in operational research and theoretical computer science.It was initially proposed for transportation,such as vehicle routing problems[3]and ship routing and scheduling problems[4].The traditional TSP can be described as follows:given a series of nodes and the distance between each pair of nodes,the objective is to find the shortest path to visit every node once and return to the starting node.In recent years,researchers have developed many different versions of TSP,including multi-objective TSP [5,6],multi-salesman TSP[7,8],multi-solution TSP[9,10],and asymmetric TSP[11,12].However,many real-world optimization problems are dynamic and require optimization methods that can adapt to a dynamically changing environment.Therefore,there has been increasing interest in addressing dynamic versions of the TSP,i.e.,the dynamic TSP(DTSP),where the topology of nodes changes[13,14]or the weights between the nodes change[15].Because DTSP is much closer to real-world situations,these problems have a high practical value.

In short,DTSP can be regarded as a series of different static TSPs over time.Therefore,the methods used to solve TSP can also show their efficiency in solving DTSP.Since the search space of TSP and DTSP is the full arrangement of all vertices (i.e.,the nodes or the cities),a combinatorial explosion can occur with the increase in the number of vertices.Some existing studies used deterministic algorithms [16] to solve TSP,including the branch and cut method [17],linear programming method [18],and dynamic programming method [19].However,with the increase in the scale of the problem,the deterministic optimization algorithms are less satisfying to solve the TSP due to the increase in computational complexity.Therefore,in recent studies,scholars focused on using approximate algorithms[16]or evolutionary computation algorithms[20,21]to solve large-scale TSPs.Evolutionary computation algorithms have been one of the most efficient methods to solve NP-hard optimization problems[22].In recent decades,various types of evolutionary computation algorithms have been developed,mainly including genetic algorithm(GA)[23,24],ant colony optimization(ACO)[25,26],particle swarm optimization[27–29],and differential evolution[30–32].

Various ACO algorithms have been proposed for NP-hard optimization problems [33,34].For instance,ACO has shown its efficiency and reliability in many optimization problems,including software project scheduling and staffing [35,36],vehicle routing problems [37,38],capacitated arc routing problems [39],virtual machine placement [40],supply chain management [41],new energy vehicle scheduling [42],and cloud workflow scheduling [43].More importantly,ACO has shown its efficiency in solving static TSP[44,45].In recent studies,Zhang et al.[46]proposed an improved ACO which evaluated the population according to the membership degree and updated the pheromone in turn to achieve a good balance in solving speed and quality.Skinderowicz[47]presented a novel ACO variant,namely the Focused ACO,to solve TSP,significantly improving performance.Considering the effectiveness of the ACO algorithm and its improved versions in solving static TSP,ACO is also considered to be promising in solving DTSP.

Therefore,this paper focuses on proposing an efficient scheme library-based ACO algorithm with a two-optimization (2-opt) strategy to solve a novel and special DTSP model.The novelties and contributions of this paper mainly include three aspects in terms of problem model,optimization framework,and algorithm design.

Firstly,this paper proposes a novel dynamic change fashion in the problem model to build the DTSP.In most of the existing studies on DTSP,the dynamic factors mainly include the change of weights or the change of distance between two nodes.To solve this kind of DTSP,researchers focus on developing local search operators to optimize the route after the environmental changes quickly.Unlike these DTSP models,this paper proposes to solve a novel and special DTSP model,which mainly focuses on the environmental change of node positions.In this DTSP model,the environmental change is known in advance,and the new environment can be similar to certain previous environments.The new path should be quickly optimized after the nodes’locations change.

Second,in the optimization framework,this paper proposes an idea of offline optimization and online application framework to efficiently reuse the historical information to help fast respond to the dynamic environment.In offline optimization,several candidate solutions are optimized in advance and stored in a mode scheme library,while in the online application,the candidate solutions stored in the mode scheme library are reused to solve the DTSP in a new similar environment.In DTSP,since the environmental changes can be similar,i.e.,after the environmental change,the new environment can be similar to a certain previous environment,the idea of offline optimization and online application that reuses the previous solutions stored in the mode scheme library can help to solve DTSP and reduce the computational complexity effectively.In this paper,a path constructed by each ant is called a solution.

Third,this paper proposes combining ACO and the 2-opt strategy in the algorithm design to find the optimal route after an environmental change efficiently.Many existing studies show that the ACO is efficient in solving the TSP.In ACO,each ant can change the surrounding environment by releasing pheromones,perceiving the changes in the surrounding environment,and communicating indirectly through the environment,which is suitable for solving optimization problems in a dynamic environment.Moreover,this heuristic probability search method does not easily fall into local optimization and is shown to find the optimal global solution of TSP efficiently.Based on these advantages,ACO is adopted as the optimization method for solving the special DTSP.However,the uncertainty of DTSP makes it more complex than traditional TSP.It has been shown that the integration of local search operators can significantly improve the performance of ACO.Therefore,this paper embedded the 2-opt strategy into the ACO to solve this type of DTSP.

The remainder of this paper is organized as follows: Section 2 introduces some existing studies on DTSP and describes the special DTSP model.In Section 3,the overall framework and method for DTSP are elucidated.Subsequently,experiments are conducted,and the results are shown in Section 4.Finally,conclusions are drawn in Section 5.

2 DTSP

This section presents previous scholars’ research on TSP and DTSP and elaborates on the proposed DTSP model.

2.1 Related Work

In recent studies on DTSP,Siemi´nski et al.[48] verified the usefulness of parallel and adaptive ant colony communities for solving the DTSP.Ma et al.[49] considered the dynamic optimization problem as a combination of a series of static optimization problems,and an adaptive ACO is proposed to solve the DTSP.The previous research on ACO for DTSP can be divided into the following four categories,as shown in Fig.1: 1) the change of node position,including adding or deleting a node[50];2)the change in node weights,such as assigning a weight to each node.When a node task must be processed urgently,its weight will change[51];3)the change of the edge endpoint.For example,if there is no passage between two nodes,it can be seen that the node’s endpoint has changed[52];and 4)the change of edge weight between nodes.For example,the time of passing through an edge changes due to weather or other reasons[53].

Figure 1:An example of DTSP classifications

ACO algorithm is one of the most commonly used methods to solve the TSP.Liu[52]proposed a rank-based ACO to solve the environmental changes caused by traffic jams and road closures between cities,which sometimes forced salespeople to change routes.Eyckelhof et al.[53] considered several ways of adapting the pheromone matrix locally and globally and presented a new ant system approach to dealing with DTSP.Mavrovouniotis et al.[54]considered a novel variation of DTSP where traffic jams occurred in a cyclic pattern and proposed a hybrid method that combined memory and immigrant schemes with ACO to address this type of DTSP.Mavrovouniotis et al.[55]also proposed integrating the unstringing and stringing local search operator with the max-min ant system to address symmetric and asymmetric DTSPs.Kuo et al.[51] proposed an algorithm using an improved fuzzy ant colony system to solve the problem of how to determine the completion order of tasks.Montemanni et al.[56]proposed a solution strategy based on the ACS paradigm to solve the dynamic vehicle routing problem.

In addition to the ACO,researchers also use other evolutionary algorithms and heuristic algorithms to solve the DTSP.Meng et al.[57]proposed a variable neighborhood search algorithm with direct route encoding and random initialization to solve the dynamic colored TSP,in which the weights of edges changed over time.Wang et al.[58] presented an agent-based evolutionary search algorithm that uses the principle of a collaborative endeavor learning mechanism to solve DTSP.Khouadjia et al.[59] presented an adaptive particle swarm for solving the vehicle routing problem with dynamic requests,which may significantly decrease travel distances and is adaptive to a dynamic environment.Strąk et al.[60]presented a discrete particle swarm optimization(DPSO)algorithm with heterogeneous (non-uniform) parameter values for solving the DTSP.Sabar et al.[61] proposed an effective population-based approach that combined a local search algorithm with various evolutionary operators (crossover and mutation) in an adaptive manner to address the dynamic vehicle routing problem.A heuristic method and a predictive technique were combined to solve DTSP in[62],which built a GA that feeds on Newton’s motion equation to show how route optimization can be improved when targets are constantly moving.

According to the survey,previous studies on DTSP mainly focused on improving algorithms and developing local search operators to address the difficulties of real-time response to environmental changes and route optimization caused by the addition of dynamic factors.Unlike the previous studies,this paper focuses not only on the improvement of the algorithm but also on the development of the DTSP model and the optimization framework.

2.2 Problem Definition

The TSP is one of the most popular and well-researched NP-hard combinatorial optimization problems.In most real-world applications,the TSP exists in a dynamic environment rather than a static one.The TSP in a dynamic environment,called DTSP,is more challenging and uncertain.As the environment changes over time,the optimal solution of DTSP will change,and the problem’s search space(called the landscape)will also change,which brings challenges.To solve DTSP efficiently,the algorithm is required to detect the change of optimization problem promptly,respond to the change quickly,and find a new optimal solution efficiently.However,it is difficult for classic evolutionary algorithms to satisfy these conditions,which leads to challenges in solving DTSP.

This paper designs a new DTSP model,where the environmental change is the change of the node position,and the environment is regularly changed.This DTSP model considers two conditions in real-world applications.First,the position of a node will change due to road or traffic problems.This change is temporal,and only one node will change in each period (a period corresponds to an environmental change).Second,since traffic jams are usually regular,the environmental changes can also be regular.Therefore,some new environments can be similar to certain previous environments.

Mathematically,the model of DTSP in this paper can be formulated as follows: Given a list of nodes,a salesman must traverse each node once and only once to build a Hamilton path.First,suppose the road map is an undirected graphG=(V,E),whereV={0,1,...,N−1} is the set of nodes(denoted by the node indices),andNis the number of nodes.The node with serial number 0 indicates the beginning and ending node of the traveling salesman;other nodes in setVindicate the nodes that the salesman must visit.E={eij|i,j∈V,i/=j}is the edge set,indicating the road/edge between the nodesiandj.Each edgeeijhas a weight value,denoted asdij,which measures the distance between the two nodesiandj.In the solution of the DTSP,the decision variablesxijandyiare defined as follows:

The objective of DTSP is to find the shortest Hamilton path when the location of a node changes over time.The Hamilton path can be formulated as permutationΦ(denoted by the node indices).Then,DTSP finds the shortest length in all feasible permutations.More precisely,this paper formulates it as follows:

The constraints for DTSP are formulated as follows:

Constraint I:All nodes are strictly visited only once.

Constraint II:Each edge of the solution has only one starting node connected to it.

wherei=0,1,...,N−1.

Constraint III:Each edge of the solution has only one destination node connected to it.

wherei=0,1,...,N−1.

Constraint IV:Solutions constituting incomplete routes are eliminated.

whereSis the branch elimination constraint.That is,the incomplete route is eliminated.

In this DTSP model,the environmental change is the change of node position.When the location of a node changes in a new environment,the path must be re-optimized.Note that,in this DTSP model,the starting node is fixed,and its position remains unchanged.When a new environment starts,the position of a node will change.The location change rule is as follows:

wherep’(x’,y’)is the position updated after a change,p0(x0,y0)is the position of the original location of the changing node,andΔpis the amplitude of position change,which is set to a random integer change(add or subtract)in the range of[5,10].

The objective of DTSP is to find the optimal Hamilton path of the salesman to minimize the total distance after the environmental change.This type of DTSP widely exists in daily life.For example,in logistics distribution,the target location may change in some real-world scenarios.After the position of a node is changed,the optimal solution in the previous environment may not be the optimal solution in the current environment.Thus,the algorithm is required to quickly find an optimal path after the change of node position.Therefore,research on this type of problem is of great practical significance.The problem will become more complex by changing some variables,such as increasing the number of salesmen or the number of nodes with changed locations.

3 Scheme Library-Based ACO with 2-Opt Algorithm

3.1 Scheme Library-Based Offline Optimization and Online Application Strategy

Different from existing studies,in the proposed DTSP model,the environmental changes can be regular.That is,the new environment can be similar to some previous environments.To enhance the performance according to this property,this paper proposes an interesting and effective idea of offline optimization and online application strategy,which stores the optimized solutions in the scheme library and reuses the solutions after the environmental change.When users encounter similar environmental changes,they can directly call the corresponding scheme stored in the scheme library for online application.If a change in the scheme library requires no optimization scheme,the proposed ACO with 2-opt algorithm is called for optimization and placing the optimized solution into the scheme library.Herein,this paper chooses ACO as the optimization algorithm.Many researchers use ACO because of its flexibility and strong adaptability in solving TSP and DTSP.Furthermore,this paper integrates the 2-opt strategy into ACO,dramatically improving local search ability[63,64].

The offline optimization and online application strategy flowchart is shown in Fig.2.The left frame shows the process of offline optimization,while the right frame shows the online application.First,a mode scheme library is created to store the already optimized solutions(i.e.,mode schemes)in the previous environments.Note that the size of the mode scheme library is unlimited,so all the solutions corresponding to all modes are stored in the mode scheme library.Each mode represents an environment under the influence of different dynamic factors.Second,once the environmental change is detected,the online application process is carried out.That is,after the environmental change is detected if the new environment is similar to a certain previous environment,the previous solution(i.e.,mode) that corresponds to a similar environment is directly used in this new environment.If the solution that corresponds to the new environment is found,the process of utilizing mode scheme library is finished,i.e.,finish the mode.Otherwise,the offline optimization strategy is carried out,where the mode optimization algorithm(i.e.,ACO with 2-opt)is adopted to find the optimal solution for the current environment,and the optimal solution is added to the mode scheme library.The above operations are repeated until the termination conditions are met.

Figure 2:The overall framework for offline optimization and online application strategy

3.2 ACO

ACO[65]is a stochastic search algorithm to simulate the foraging process of real ants in nature.ACO is a swarm intelligence algorithm with the advantages of robustness,global search,parallel distribution calculation,and easy combination with other methods.In ACO,the principle of finding the shortest path to the food source is described as follows:each ant releases pheromones on the way to the food source.Over time,the pheromones on the longer path will not accumulate because of the long passing of time,while those on the shorter path will easily accumulate because of the short passing of time.Therefore,more pheromones can be left per unit of time on the shorter path.Other ants choose the path with the strongest pheromone by judging the concentration of“pheromone”,which causes them to focus more on the shorter path.Through this continuous positive feedback,the entire ant colony will eventually choose the shortest path to find food.Due to its good results,ACO is widely used to solve all combinatorial optimization problems,especially TSP[66,67].This paper adopts the elitist ant system(EAS)to solve DTSP.The two main steps in the process are path construction and pheromone update.

3.2.1 Path Construction

In the beginning,each ant selects a node as its starting node (In this paper,the starting node is set as node 0)and maintains a visited nodes sequence to store the visited nodes.In each step of path construction,the ant selects the next node to visit.Specifically,if the current node of antkisi,the ant chooses the next nodejin its path from the remaining nodes based on the roulette wheel selection,with the selection probability of each node computed as follows:

whereJk(i)represents the set of nodes that can be reached directly from nodei,and in order to avoid selecting repetitive nodes,a tabu table is set up in the algorithm to store the nodes visited by ants.Therefore,the set of nodesJk(i)also should not be in the sequence of the visited nodes.η(i,j)is heuristic information calculated byη(i,j)=1/dij.τ(i,j)represents the amount of pheromone on edgeeij.αand;are two user-defined parameters.αis the parameter of pheromone importance,βis the expectation factor.

3.2.2 Pheromone Updating

In ACO,the pheromone is updated after all ants complete the travel operation.As shown in Eq.(9),pheromones between nodes are the most important factor affecting the probability ofpk.The pheromone update includes three parts.First,after each ant finishes its travel,the pheromones on all paths evaporate,which prevents the pheromone from accumulating with the increase in iteration times.Specifically,this paper multiplies the pheromones on all edges by a constant less than 1(i.e.,1−ρ).Second,all ants release pheromones to the edge of their paths according to the path length.Thirdly,the EAS strengthens the pheromone update on the optimal path.That is,after the pheromone update,the ant with the best pathRbadds additional pheromones to each edge of the pathRb.These additional pheromones can make the ants search faster and converge more correctly.Based on these three parts,the update of the pheromone quantityτ(i,j)on edge between nodesiandjis shown as the following formula:

wheremis the number of ants.The greater the number of ants is,the more accurate the optimal solution will be,but many repeated solutions will be generated.As the algorithm approaches the optimal value,the positive feedback of information will be reduced,and a considerable amount of repeated work will consume resources and increase the time complexity.ρis the evaporation rate of pheromone.Whenρis too small,there are too many pheromones left on each path,resulting in an invalid path continuing to be searched,affecting the algorithm’s convergence speed.Whenρis too large,although an invalid path can be excluded from the search,it cannot guarantee that the effective path will also be abandoned,affecting the search for the optimal value.Δτk(i,j)is the amount of pheromone released by thekth ant on the edge it passes,which is equal to the reciprocal path length of antkin this round.Lkrepresents the path length,which is the sum of the lengths of all edges inRk.Parameterwis the edge weight value,andLbis the optimal path length since the algorithm’s beginning.

3.3 Two-Optimization(2-opt)

The 2-opt operation,first proposed by Croes et al.[68],is a widespread and straightforward local search method that has been widely used to cooperate with evolutionary algorithms to improve solution quality[69].Herein,this paper combines the 2-opt operation with ACO to enhance efficiency.In each generation,the 2-opt strategy is applied to the edges in each solution (i.e.,each ant) to generate new solutions by reserving the directions of several consecutive edges.First,the 2-opt strategy randomly selects two different edges and deletes these two edges.After the deleting operation,two edge sequences are obtained.Then,the edge sequence between these two edges is reversed,and this reversed edge sequence is re-connected to the other edge sequence to obtain a new solution.Finally,the better one(i.e.,the solution with a shorter path)between the new solution and the original solution is reserved.Specifically,the route in Fig.3 serves as an example.The original solution is “1–2–3–7–6–5–4–8–1”.Suppose edges 3–7 and 4–8 are the selected edges in 2–opt.In the original solution,the edge sequence between edge 3–7 and edge 4–8 is“7–6–5–4”,and the other edge sequence is“8–1–2–3”.After the reverse operation in the 2–opt strategy,the edge sequence between edge 3–7 and edge 4–8 is changed to “4–5–6–7”.Re-connecting edge sequences “8–1–2–3”and “4–5–6–7”,a new solution“1–2–3–4–5–6–7–8–1”is generated.Obviously,the path length of the new solution is shorter than that of the original solution.Thus the original solution is replaced by the new solution.

Algorithm 1:The pseudocode of offline optimization by ACO with 2-opt Input:Maximum number of iterations T,Number of nodes N.Begin 1:For each edge Do 2:Initialize pheromone τ0;3:End For;4:count=0;5:While(count

Based on the above description of ACO and 2-opt strategy,the pseudocode of the proposed ACO with 2-opt algorithm is given in Algorithm 1.At the beginning of the algorithm,initialize the ants and the pheromone on each edge.The starting node(i.e.,the first node of each path)of each ant is set as node 0.Then,the roulette selection method selects the next visiting node for each ant according to the probability calculated by Eq.(9).After that,to help the algorithm efficiently find the optimal solution,the 2-opt strategy is performed on each ant to improve the quality of the constructed path.The fitness evaluation is carried out in line 13,which calculates the path length constructed by each ant.Then,for each edge,the pheromone quantity is updated according to Eq.(10).Finally,if the termination condition of the algorithm is met(i.e.,reach the maximum number of iterations 5000),the algorithm is finished,and the best result is retained.

Figure 3:An example of the 2-opt operation

4 Experimental Studies

4.1 Experimental Setting

To evaluate the performance of ACO with 2-opt in the special DTSP,this paper set the number of modes(i.e.,number of environmental changes)to eight.The DTSP benchmark instance kroA200 is adopted as the benchmark DTSP from the classic TSP library(TSPLIB)[70].In this benchmark,there are 200 cities(i.e.,nodes).And set the starting node as the node with serial number 0,and the eight modes correspond to the location changes of eight nodes.This paper assumes that the indexes of these changed nodes are 100,140,120,160,40,80,20,and 60.Besides,to further verify the robustness of the ACO with 2-opt,this paper also tests it on a large-scale data set nrw1379[70].In this benchmark,there are 1379 cities(i.e.,nodes).The parameters of the ACO algorithm are set to typical values,as shown in Table 1.The experimental results are obtained from ten independent execution of each algorithm and select one set of results to draw roadmaps.Finally,it deserves to be noted that all algorithms are implemented in the C++ language and executed on a PC with Intel(R) Core (TM) i5-10210U 1.60 GHz CPUs,16 GB memory,and a 64-bit Ubuntu 12.04 LTS system.

Table 1: Parameter settings

4.2 Experimental Results

In the experiments,this paper defines eight types of environmental changes and denotes the eight periods corresponding to eight different environments ast1–t8.In each period,the position of a predefined node is changed.Specifically,the eight indexes of the changed node corresponding to the eight periodst1–t8are 100,140,120,160,40,80,20,and 60 in the instance kroA200.For example,when the period reachest5,the location of node 40 will change.Besides,the eight indexes of the changed node corresponding to the eight periodst1–t8are 900,150,450,300,1050,600,750,and 1200 in the instance nrw1379.These types of environmental changes cyclically occur until the algorithm meets the termination condition.

Firstly,to show the effectiveness and efficiency of the ACO with 2-opt in solving the DTSP proposed in this paper,this paper conduct the comparison between the proposed ACO with 2-opt,ACO,and GA on the DTSP model.The results with respect to the best value and the mean value on kroA200 and nrw1379 are shown in Tables 2 and 3,respectively.In the table,Change-Index represents the index of the node whose position is changed in the corresponding environment.The best value represents the minimum total path length in the 10 independent executions,and the mean value represents the average path length over the 10 executions.In addition,in this table,the best results between the algorithms with respect to the best value are highlighted in boldface,while the best results in terms of the mean value are shaded.As we can see,considering the best value and mean value,the performance of the ACO with 2-opt is superior to that of the compared algorithms in solving DTSP.In most cases with different indexes of changed nodes,the proposed ACO with 2-opt shows the best performance than the compared algorithms.Therefore,our proposed ACO with 2-opt effectively and efficiently solves DTSP.

Table 2: Comparison results between GA,ACO,and ACO with 2-opt on kroA200

Table 3: Comparison results between GA,ACO,and ACO with 2-opt on nrw1379

Secondly,to show the effect of the local search operation 2-opt,this paper conducts a comparison between the proposed ACO with 2-opt and the ACO without 2-opt,and the results of the comparison are also shown in Tables 2 and 3.From Tables 2 and 3,the following conclusions can be drawn: 1)Concerning the results with respect to the best value in the 10 independent runs,the ACO with 2-opt achieves the best performance on all the 8 modes in kroA200 and 5 modes in nrw1379,while the ACO without 2-opt achieves the best performance on no modes in kroA200 and 3 modes in nrw1379.2)From the mean value over the 10 independent runs,ACO with 2-opt achieves the best results on all the 8 modes in both kroA200 and nrw1379,while ACO without 2-opt does not obtain the best results on any modes.3)With the different indexes of changed nodes,regardless of the perspective of the best value or the perspective of the mean value,the superiority of ACO with 2-opt over ACO without 2-opt becomes increasingly notable.Therefore,the 2-opt is necessary for ACO with 2-opt algorithm.

The route optimization diagrams of the eight modes on kroA200 and nrw1379 are shown in Figs.4 and 5,respectively.The pentagram red dot indicates the starting node,which is labeled node 0.The round green dot indicates the node whose location has changed.(a) shows the original distribution of nodes in the kroA200 and nrw1379 test sets.The figures from (b) to (i) provide 8 modes in the scheme library corresponding to the change in node location in each period,which is the result of 5000 iterations.Figs.4 and 5 show the roadmap for online applications.

Figure 4: Node distribution position map and route scheme maps of instance kroA200.(a) Node distribution position map.(b) Route scheme map in environment t1.(c) t2.(d) t3.(e) t4.(f) t5.(g) t6.(h)t7.(i)t8

Figure 5: (Continued)

Figure 5: Node distribution position map and route scheme maps of instance nrw1379.(a) Node distribution position map.(b) Route scheme map in environment t1.(c) t2.(d) t3.(e) t4.(f) t5.(g) t6.(h)t7.(i)t8

5 Conclusion

In this paper,a new DTSP model is proposed and solved,focusing on the change in node positions over time.In this type of DTSP,a node will regularly change its location due to traffic or other factors during a certain period.This change occurs randomly.Second,this paper proposes an idea of offline optimization and online application.Several solutions are optimized in advance and stored in the scheme library in offline optimization.In the online application,when the environment has changed,and some roads are not feasible,this paper uses the solutions in the scheme library to provide optimal routes to the new similar environment.Third,to effectively solve DTSP via the development of the algorithm,this paper proposes an ACO with a 2-opt strategy.Because ACO is shown to be one of the most effective algorithms in solving TSP,ACO is adopted as the basic optimization method in the special DTSP model and carried out experiments to show its effectiveness of ACO.Our goal is that when the node positions change during this period,the algorithm can provide a new route in advance and minimize the total path length.The experimental results on the DTSP show that this strategy greatly improves the efficiency of the ACO.

Based on this paper,additional versions of DTSP can be extended for future work,and the solution will be more complex.Determining methods to balance the algorithm’s effectiveness and efficiency is a worthy research topic.

Funding Statement:This work was supported in part by the National Research Foundation of Korea(NRF-2021H1D3A2A01082705).

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.