Optimizing UAV Path Planning in Maritime Emergency Transportation: A Novel Multi-Strategy White Shark Optimizer

: Maritime UAV path planning is a key link in realizing the intelligence of maritime emergency transportation, providing key support for fast and flexible maritime accident disposal and emergency material supply. However, most of the current UAV path planning methods are designed for land environments and lack the ability to cope with complex marine environments. In order to achieve effective path planning for UAV in marine environments, this paper proposes a Directional Drive-Rotation Invariant Quadratic Interpolation White Shark Optimization algorithm (DD-RQIWSO). First, the directional guidance of speed is realized through a directional update strategy based on the fitness value ordering, which improves the speed of individuals approaching the optimal solution. Second, a rotation-invariant update mechanism based on hyperspheres is added to overcome the tracking pause phenomenon in WSO. In addition, the quadratic interpolation strategy is added to enhance the utilization of local information by the algorithm. Then, a wind simulation environment based on the Lamb–Oseen vortex model was constructed to better simulate the real scenario. Finally, DD-RQIWSO was subjected to a series of tests in 2D and 3D scenarios, respectively. The results show that DD-RQIWSO is able to realize path planning under wind environments more accurately and stably.


Introduction
In contemporary times, the application scope of unmanned aerial vehicles (UAVs), which are a kind of controllable vehicle that performs various tasks without human assistance [1], is progressively diversifying, exhibiting significant potential and advantages across various domains such as transportation [2] and construction [3].With continuous technological advancements, UAVs have emerged as versatile tools for civilian and commercial applications [4].The marine environment is complex and varied, and compared to the study of the land environment, the marine scene has a rich and diverse range of research scenarios [5][6][7][8], such as marine transportation [9], marine monitoring [10] and other fields [11,12].Among them, the research on marine transportation has become one of the hot spots of marine research.In maritime environments, there may arise situations necessitating urgent material supply, for which a device capable of swift response and delivery is required.Compared to traditional lifeboats and life rafts, UAVs offer lower time and economic costs and broader transportation coverage, and thus are more conducive to the transportation of materials in maritime emergencies.
Path planning is a critical issue within the field of automation [13], and for UAVs, solving their path planning problems holds considerable value in their applications.Path planning is inherently a combinatorial optimization problem, tasked with generating a geometric path without any specified time constraints.The objective of path planning is to devise a feasible route that ensures safe arrival at the destination based on known environmental information.Its definition is succinct: "find a collision-free motion between an initial (start) and a final configuration (goal) within a specified environment" [13].
Research shows that the path planning problem is an NP-hard problem [14], which means this problem entails large search spaces, making it exceedingly difficult to find the optimum.Solutions to this problem often necessitate exhaustive exploration of numerous possibilities, without any efficient method guaranteeing the discovery of the optimal solution within polynomial time.Consequently, while it may be straightforward to verify whether a given solution is correct, finding a solution itself is highly challenging [15].
Various algorithms, such as A* [16], RRT [17] and PRM [18], are proposed to solve the path planning problem and obtain a feasible solution.As the complexity of the problem to be solved increases, many such improved algorithms have been proposed and applied.For instance, the LDA* algorithm is proposed to finish the path planning mission in a game by Bayili et al. [19] and Moon et al. use a modified RRT algorithm to accomplish the highspeed navigation for mobile robots [20].However, traditional path planning algorithms typically depend on pre-acquiring and memorizing the global environment graph.These cause an inherent disadvantage of this kind of algorithm.It is time-consuming to use such algorithms for path planning in intricate environments.
Swarm intelligence algorithms mimic swarm behaviors observed in nature to develop intelligent algorithms.Guided by these behaviors, they strategically employ random operators to optimize the trial-and-error process, replicating the exploration of solution spaces based on swarm wisdom.Different from common meta-heuristic algorithms, individuals in swarm intelligence algorithms exhibit independent intelligence and behavior, while enhanced information exchange among the population bolsters the optimization capability.These characteristics empower swarm intelligence algorithms to effectively address complex optimization problems.For example, El-Kenawy et al. propose the hybrid gray wolf optimizer (GWO) and particle swarm optimization (PSO) algorithms to select the best set of features [21].Mei et al. used an improved meta-heuristic algorithm to enhance the accuracy of target localization in underwater IoT tasks [22].Mohamed Abdel-Basset modifies the whale optimization algorithm (WOA) to tackle the issue of multi-level thresholding color image segmentation [23].Given the excellent properties of swarm intelligence algorithms, there are numerous studies that apply swarm intelligence algorithms to path planning [24,25].The utilization of a modified particle swarm optimization in the path planning of robots has been investigated by Li et al. [26], demonstrating its effectiveness in addressing the path-planning problem.Zhang et al. applied particle swarm optimization to multi-objective path planning, showcasing its potential in the path-planning problem.Zhu proposed an enhanced meta-heuristic algorithm to address the unmanned combat aerial vehicle (UCAV) path planning [27], and Niu et al. solved three-dimensional UCAV path planning in maritime environment with a modified artificial ecosystem optimizer [28], illustrating the feasibility of applying swarm intelligence algorithms to UAV path planning.
Path planning stands as a critical technology ensuring the safety of UAVs in maritime emergency transportation.In maritime environments, path planning for unmanned devices necessitates consideration of marine environmental factors, as well as fuel consumption and obstacle avoidance.For instance, during autonomous underwater vehicle (AUV) path planning in the seafloor environment, factors such as ocean currents and seabed topography are taken into account [29,30].While UAVs do not require accounting for ocean currents akin to AUV, the impact of marine winds on UAVs cannot be disregarded.
Although numerous swarm intelligence algorithms have been applied to path planning, as described in the "No Free Lunch (NFL)" theorem [31], no single algorithm can tackle all optimization issues.Consequently, exploring novel algorithms to tackle path planning challenges remains a valuable avenue of research, particularly given the increasing complexity of application scenarios and problems.In recent years, several advanced swarm intelligence algorithms have been proposed, including the mayfly algorithm (MA) [32], reptile search algorithm (RSA) [33], beluga whale optimization (BWO) [34] and white shark optimizer (WSO) [35], etc.Among these, WSO stands out as an exemplary swarm intelligence algorithm, which effectively balances the exploration and exploitation processes, and also has excellent performance in some high-dimensional complex test problems, which can effectively detect the search space and converge to the optimal solution to some extent [35].The existing literature has shown that the WSO algorithm can be applied to the path planning problem and has a great potential [36].What is more, the WSO algorithm has successfully addressed numerous different types of problems such as optimal power flow problems [37,38], cloud computing [39] and the performance of the design parameters of proton exchange membrane fuel cells (PEMFCs) [40].When integrated with other strategies, WSO demonstrates capability in resolving more intricate problems.For instance, Ravishankar et al. applied WSO combined with deep learning to solve the problem of UAVs communication and scene classification [41], and Houssein et al. use a modified WSO to solve engineering design and combinatorial problems [42].
The core concepts and inspiration behind the WSO are derived from the behavior of sharks to track prey.This creature possesses remarkable sensory abilities, encompassing their extraordinary auditory, visual and olfactory capabilities.The WSO's design incorporates a promising blend of exploratory and exploitative search strategies within its update mechanism, achieved through stochastic updates and alterations of solutions.This behavior is modeled in conjunction with the appropriately defined conditions of white shark locomotion and olfactory intensity to foster exploratory behavior in the initial iterations of WSO and exploitative search mechanisms in subsequent iterations.
However, the complexity of problems in the maritime environment, exacerbated by the influence of complex sea winds, imposes higher demands on optimization algorithms, necessitating an enhancement in the search capabilities of WSO populations.There are still some places where WSO can be improved.For instance, in guiding the velocity using individual recorded best values, WSO relies entirely on random individuals, which may not fully exploit the information from individual records, particularly for individuals with less favorable results that warrant stronger optimization behavior.During the fish schooling stage, if the white shark's visual abilities are insufficient to track the optimal fish, it remains stationary, resulting in missed opportunities to exploit the solution space.
In order to better address path planning problems for UAV maritime emergency transportation, this paper proposes an improved version of the WSO algorithm, named Directional Driven-Rotational Invariant Quadratic Interpolation White Shark Optimizer (DD-RQIWSO), which addresses the aforementioned issues.To accurately simulate the maritime environment, we augment the UAV environment model [43,44] used by Selcuk Aslan and Y. Niu with factors such as sea winds, thus rendering the environment model more representative of real transportation scenarios.The contributions of this paper can be summarized as follows: • Through the directed update strategy based on sorted cost, the velocity update process of WSO is modified to prevent the degradation phenomenon induced by random individuals, thereby achieving the rapid search capability of directed guidance algorithms;  The next part of this paper is structured as follows: The problem formulation of UAV path planning is demonstrated in Section 2. The basic WSO algorithm is displayed in Section 3. The description of the proposed DD-RQIWSO will be shown in Section 4. To inspect the performance of the modified WSO, a series of experimental results are shown in Section 5. Section 6 provides a conclusion of this work.

Problem Formulation
Path planning for maritime emergency transportation endeavors to derive an optimal or near-optimal flight route, mitigating risks posed by the intricate maritime environment, including strong electromagnetic interference and the presence of restricted areas of navigation or other areas the UAV cannot fly.Entrance into these areas by a UAV can lead to loss of communication or damage, thus necessitating the designation of no-fly zones within the environment.Fuel consumption emerges as a critical factor influencing UAV flight, primarily dictated by factors such as flight distance and flight smoothness.Wind at sea also impacts fuel consumption and harbors the potential to compromise UAV safety.Consequently, all threat areas and restricted areas of navigation are delineated as no-fly zones.UAVs are instructed to maintain a minimum altitude above sea level, adhere to minimum distance and corner constraints and leverage wind conditions to optimize flight efficiency and safety.

Modeling of UAV Path Planning
To address the path planning problem, a Cartesian coordinate system can be established and in determining the starting point (S) and the target point (T) of the UAVs in the space, the path of the UAV flight is determined by a specific number of key points in the space [44].Define ST as the line connecting S point and T point.To reduce computational complexity and the number of decision variables for the algorithm, coordinate transformation is applied to this space.Assuming n key points are uniformly distributed along ST, segmenting ST into n + 1 intervals, and the x'-axis is fixed to the projection of the Oxy-plane of the ST through the coordinate transformation, the key points are considered to be uniformly distributed in the transformed x'-axis in x component, then the coordinates of the key points in the direction of the x'-axis can be determined by the starting and target points as well as by the number of key points.After this transformation, the optimization algorithm only needs to locate the y' and z' components of n key points within the constraints to determine the key points' positions.Connecting all key points generates a path; this procedure is shown in Figure 1.The path planning problem in 2D space can be regarded as the result of projection of 3D space to the Oxy plane.
The purpose of coordinate transformation is to align ST with the Oxz plane, placing S on the z'-axis, which significantly reduces computational complexity.The transformation process is described mathematically as follows: where θ represents the rotation angle in the rotation matrix, (x, y, x) denotes the coordinates in the original space and (x ′ , y ′ , z ′ ) represents the coordinates in the transformed space; ∥ST∥ is the distance of S and T. For path planning on a two-dimensional plane, the above model is projected onto the Oxy plane, followed by the same steps for path planning in three-dimensional space.
where θ represents the rotation angle in the rotation matrix, ( , , )  x y x denotes the coor- dinates in the original space and ( ', ', ') x y z represents the coordinates in the transformed space; ST ‖ ‖ is the distance of S and T. For path planning on a two-dimensional plane, the above model is projected onto the Oxy plane, followed by the same steps for path planning in three-dimensional space.

Cost Function
To guide the algorithm in selecting the optimal path, we have devised a cost function to quantitatively assess the quality of the current path.The quality of the paths discovered by the algorithm can be evaluated by the cost function, and the path with lower cost will be chosen by the algorithm.To comprehensively evaluate path quality, we consider four aspects of path cost.Firstly, we need to measure the threat posed to the flight path ( k t ), which quantifies the impact of threat areas on UAV flight.This component of cost incentivizes the algorithm to generate paths that avoid threat areas as much as possible.Secondly, flight distance ( d ) is also a crucial consideration, as it strongly correlates with UAV fuel consumption.Shorter distances imply lower fuel consumption, which is not only environmentally desirable but also improves UAV efficiency.Additionally, sharp corners increase fuel consumption and diminish UAV endurance.Thus, generated paths should aim for smoothness ( s ), quantified by the cost of corners in the flight path.Lastly, the influence of wind ( v ) on UAV flight cannot be overlooked in maritime environment.Aligning UAV flights with the direction of the wind reduces damage and fuel consumption while enhancing endurance.The overall cost function is a linear combination of these four cost components, represented mathematically as follows: (2) Step1

Cost Function
To guide the algorithm in selecting the optimal path, we have devised a cost function to quantitatively assess the quality of the current path.The quality of the paths discovered by the algorithm can be evaluated by the cost function, and the path with lower cost will be chosen by the algorithm.To comprehensively evaluate path quality, we consider four aspects of path cost.Firstly, we need to measure the threat posed to the flight path (t k ), which quantifies the impact of threat areas on UAV flight.This component of cost incentivizes the algorithm to generate paths that avoid threat areas as much as possible.Secondly, flight distance (d) is also a crucial consideration, as it strongly correlates with UAV fuel consumption.Shorter distances imply lower fuel consumption, which is not only environmentally desirable but also improves UAV efficiency.Additionally, sharp corners increase fuel consumption and diminish UAV endurance.Thus, generated paths should aim for smoothness (s), quantified by the cost of corners in the flight path.Lastly, the influence of wind (v) on UAV flight cannot be overlooked in maritime environment.Aligning UAV flights with the direction of the wind reduces damage and fuel consumption while enhancing endurance.The overall cost function is a linear combination of these four cost components, represented mathematically as follows: where w 1 , w 2 , w 3 and w 4 represent the weights assigned to each factor in the composite cost, and C t , C d , C s and C v are the respective cost functions for threat, distance, smoothness and wind.

Threat Cost
The threat cost function (C t ) evaluates the threat posed to the UAV during its flight, which is based on the distance between five particular points in each segment and the center of the threat area.It is only calculated when the UAV enters a threat area; otherwise, it remains zero.When one of the segments enters the threat area, it is not enough to evaluate the threat level by only one point, and based on the former study [44], a more accurate threat level can be obtained by selecting five points for each segment of the path that enters the threat area to assess the threat level.The severity of threat (t k ) in each threat area is determined according to the threat area.The threat cost function is formulated as follows: where L i represents the length of each segment, d denotes the distance between five particular points in the segment and the center of the threat area, t k is used to denote the severity of the kth threat area, while M t represents the total number of threat areas.
Considering Figure 2, the crucial model in calculating the threat level is a segment of the path determined by the current key point to the preceding one, denoted as L i .A segment will be taken account only if this segment falls into the threat area.The total threat cost function is obtained by adding the threat degree of each segment, as shown in Equation ( 4).
The threat cost function ( t C ) evaluates the threat posed to the UAV during its flight, which is based on the distance between five particular points in each segment and the center of the threat area.It is only calculated when the UAV enters a threat area; otherwise, it remains zero.When one of the segments enters the threat area, it is not enough to evaluate the threat level by only one point, and based on the former study [44], a more accurate threat level can be obtained by selecting five points for each segment of the path that enters the threat area to assess the threat level.The severity of threat ( k t ) in each threat area is determined according to the threat area.The threat cost function is formulated as follows: where i L represents the length of each segment, d denotes the distance between five particular points in the segment and the center of the threat area, k t is used to denote the severity of the kth threat area, while t M represents the total number of threat areas.Con- sidering Figure 2, the crucial model in calculating the threat level is a segment of the path determined by the current key point to the preceding one, denoted as i L .A segment will be taken account only if this segment falls into the threat area.The total threat cost function is obtained by adding the threat degree of each segment, as shown in Equation (4).

Path Distance
The distance function d C represents the cost of UAV flight, computed by summing the Euclidean distances between adjacent key points in the flight path, as described below: Figure 2. Threat level calculation schematic.

Path Distance
The distance function C d represents the cost of UAV flight, computed by summing the Euclidean distances between adjacent key points in the flight path, as described below: where (x i ,y i ,z i ) represents the spatial coordinates of the current key point, and represents the spatial coordinates of the previous key point.After obtaining the length of each segment, they are summed to obtain the total distance cost.

Path Smooth Cost
The smoothness cost function C s evaluates the smoothness of the flight path, considering both the turning angle and climbing angle.It is modeled as Equation (7).
(1) Turning angle Vigorous cornering can lead to increased fuel consumption, so the turning angle of each segment does not exceed the maximum angle ∅ max .p j p j+1 denotes the direction vector projected on the Oxy plane of the jth path segment from the jth key point to the j + 1th key point and p j+1 p j+2 denotes the direction vector projected on the Oxy plane from the j + 1th key point to the j + 2-th key point of the jth segment.
∅ j = arctan ∥p j p j+1 ×p j+1 p j+2 ∥ p j p j+1 •p j+1 p j+2 (2) Climbing angle The climbing angle is subject to a threshold Ψ max , ensuring the UAV's turning angle does not exceed this threshold during flight to avoid excessive fuel consumption.The climbing angle is obtained from the third-dimensional coordinates and is represented by z j and z j+1 in Equation (9).p j p j+1 denotes the direction vector of the j-th path segment from the j-th key point to the j + 1th key point.

Wind Cost
The wind cost function F v assesses the impact of wind on the UAV's flight path, leveraging recorded wind intensity data.Inspired by the application of Lamb-based gas dynamics [45], this study utilizes the Lamb-Oseen vortex model to simulate vortex gas flow scenarios in an ocean scenario.The math formulation of Lamb-Oseen vortex is modeled as Equation (10). Figure 3 shows the wind environment, and includes the two-dimensional and three-dimensional wind field.We perform slicing operations on the wind along the z-axis and calculate the cost of wind in each slicing plane.The calculation method [46] is illustrated in Figure 4, where ( ) X t represents the projection of the path direction vector on the plane and ( ) V r repre- sents the vector of wind.By measuring the angle between ( ) X t and ( ) V r , we can project the wind vector onto the path direction and calculate the vertical and horizontal components of the wind using Equation (12).The formula projects the vector intensity at a point in the forward direction, specifically as ( )* V r cosθ .When θ is less than 90, the result is negative, indicating a downwind direction, and if not, indicating against the breeze.Additionally, the formula accounts for the direction perpendicular to the forward direction, which is represented as ( )* V r sinθ .This direction aims to evaluate the energy consump- We perform slicing operations on the wind along the z-axis and calculate the cost of wind in each slicing plane.The calculation method [46] is illustrated in Figure 4, where X(t) represents the projection of the path direction vector on the plane and V(r) represents the vector of wind.By measuring the angle between X(t) and V(r), we can project the wind vector onto the path direction and calculate the vertical and horizontal components of the wind using Equation (12).The formula projects the vector intensity at a point in the forward direction, specifically as V(r) * cosθ.When θ is less than 90, the result is negative, indicating a downwind direction, and if not, indicating against the breeze.Additionally, the formula accounts for the direction perpendicular to the forward direction, which is represented as V(r) * sinθ.This direction aims to evaluate the energy consumption bias caused by the wind, which requires energy-consuming corrections.To reduce computational complexity, we conduct calculations at five uniformly distributed locations between adjacent key points.The values obtained at all sampling points along the total path are aggregated as the result.However, as negative values may arise from this process, we normalize the results using Equation ( 13) to map the result to an appropriate range, thereby yielding the final cost function.In Equation ( 13), a is a parameter that regulates the range of the interval, b is a parameter representing the slope of this function.For the two-dimensional case, the same operations are performed on the Oxy plane to obtain two dimensional C v .11)

White Shark Optimizer
The white shark optimizer (WSO) [35], inspired by the predatory behavior of sharks in nature, is a swarm intelligence optimization algorithm invented by Malik Braiket et al.The optimization algorithm can be divided into two main stages.The first phase involves approaching the optimal prey, where each white shark autonomously seeks out potential prey within the search space or further explores promising prey locations based on detected signals.The second phase entails approaching the optimal white shark, resembling schooling behavior.

Initialization of WSO
WSO is a population-based optimization algorithm, where each shark corresponds to a solution, and the number of sharks in the population equals the number of candidate solutions.The position of a shark in the search space represents a set of decision-variable values that solve the optimization problem.The population of WSO is initialized by randomly generating a set of solutions in the solution space.All solutions of the algorithm can be represented by an * n d , where n represents the number of sharks in the population, d represents the dimensionality, i.e., the number of decision variables in a solution, and the data structure is represented as follows: Each decision variable is initialized by the following equation.

White Shark Optimizer
The white shark optimizer (WSO) [35], inspired by the predatory behavior of sharks in nature, is a swarm intelligence optimization algorithm invented by Malik Braiket et al.The optimization algorithm can be divided into two main stages.The first phase involves approaching the optimal prey, where each white shark autonomously seeks out potential prey within the search space or further explores promising prey locations based on detected signals.The second phase entails approaching the optimal white shark, resembling schooling behavior.

Initialization of WSO
WSO is a population-based optimization algorithm, where each shark corresponds to a solution, and the number of sharks in the population equals the number of candidate solutions.The position of a shark in the search space represents a set of decision-variable values that solve the optimization problem.The population of WSO is initialized by randomly generating a set of solutions in the solution space.All solutions of the algorithm can be represented by an n * d, where n represents the number of sharks in the population, d represents the dimensionality, i.e., the number of decision variables in a solution, and the data structure is represented as follows: Each decision variable is initialized by the following equation.
where ω i j represents the decision variable value of the jth dimension in the ith solution, u j represents the upper bound of the jth decision variable, l j represents the lower bound of the jth decision variable and r is a random number uniformly distributed in the range [0, 1].

Movement Speed towards Prey
The speed at which white sharks move towards prey is updated based on the globally optimal prey position detected by the white shark population and the locally optimal prey position detected by a randomly selected white shark in the population.The abstraction of the velocity of the white shark swimming towards its prey is shown in Equation (16).
where c 1 and c 2 are random numbers in the interval [0,1].v i k denotes the current velocity and µ is defined as shown in Equation (18).w gbest k is the globally optimal position obtained at the kth iteration.w v i k best denotes the optimal prey position randomly selected, determined by the Equation (17).
µ can control the probing step size, and adjusting τ in µ can control its ability to escape from local optima.According to relevant studies, τ is set to 4.125 [35].

Movement towards Optimal Prey
White sharks can freely explore the search space or approach optimal prey locations based on known information.This process is mathematically modeled as follows: where w i k represents the position vector of the ith shark in the kth iteration and u and l are the upper and lower boundary vectors of the decision variables, respectively.While w o is the logical operator defined by Equation ( 26), ⊕ means the different or, and ¬ stands for the bitwise not operation.The role of the logical values is defined by Equations ( 24)- (26).The effect of w o in Equation ( 21) is to reset the data when the range of the decision variable occurs out-of-bounds.
In swimming to the optimal prey, the frequency of undulating motion f is determined by Equation (23).f min and f max represent the bound of undulation frequency of shark movement.In the context of this paper, f min and f max are 0.07 and 0.75 [35], respectively.mv is the main parameter used to balance exploitation and exploration behaviors.

Movement towards the Best White Shark
During the hunting process, white sharks cloud swim towards or beyond the optimal white shark.The process is mathematically modeled as follows: w i k+1 represents the relative position of the i-th shark when tracking the optimal shark.r 1 , r 2 and r 3 represent random numbers in the range [0,1].The sgn(r 2 − 0.5) controls the direction of shark movement, taking values of −1 or 1, indicating whether the current shark explores between the optimal shark or leaps over it.The exploration range does not exceed twice the distance between two sharks.Then, D w represents the distance from the i-th shark to the optimal shark, where rand denotes a random number on [0,1].
The strength of the auditory perception of white sharks determines the ability of sharks to follow the best shark.If they cannot track the best shark, they will remain where they are.s s refers to the strength of senses and it decides if the shark will follow the best shark.s s is defined by Equation ( 29), where a 2 balances the exploitation and exploration behaviors when executing the fish school strategy and is usually set as a constant 0.0005 [35].

Fish School Behavior
The next-generation shark position is obtained through the relative position when tracking the optimal shark and the position of the i-th shark.It synthesizes the relationship between the optimal shark and the current shark to gather at the position most likely to discover high-quality prey.This process can be mathematically described as follows: where rand is a random number on the interval [0, 1], w i k and w i k represent the near optimal position and position of i-th shark, respectively.Equation (30) indicates that sharks can adjust their position based on the position of the optimal shark closest to the prey.Eventually, the shark will be located in a position close to the optimal prey.Fish school behavior is defined based on the collective action of sharks during hunting, which can provide more possibilities for detecting optimal solutions.

Proposed DD-RQIWSO Algorithm for Path Planning
Due to the complexity of route planning problems in maritime transportation, three novel strategies are incorporated into the original WSO to form the proposed DD-RQIWSO algorithm, which belongs to the category of swarm intelligence algorithms.This paper employs the proposed DD-RQIWSO algorithm to solve the shortest path problem in maritime emergency transportation.Sections 4.1-4.3introduce the three strategies, respectively.Section 4.4 presents the overall framework of the DD-RQIWSO algorithm for solving route planning problems.Section 4.5 analyzes the main parameters of the algorithm and Section 4.6 provides an analysis of the time complexity of DD-RQIWSO.

Directional Update Strategy
Degradation refers to the phenomenon where, during velocity update, the locally optimal value obtained by randomly selecting population individuals causes poorer individuals to fail to interact with superior individuals, resulting in stagnation and reduced exploration efficiency.To address this issue, a human learning mechanism is introduced.This mechanism is inspired by the behavior of mutual learning in human society, where individuals with weaker learning abilities tend to seek advice from superior individuals to obtain a better understanding, thereby enhancing the overall learning ability of the population.In this learning behavior, individuals with weaker learning abilities not only learn from the best individuals but also learn from inferior individuals [47].
Based on this concept, within the white shark swarm, there is a phenomenon of mutual learning.They can learn from their peers who hold advantageous positions and adjust their forward velocity by observing the exceptional individuals.Referring to the conclusion of article [47], we can categorize individuals into elite and inferior groups based on a specific population ratio δ and set δ = 5%.In simple terms, if the population size is 100, the first 5 and the last 5 particles will be chosen as the elite particle group and the underperforming particle group, respectively.The screening process is as follows: Firstly, all individuals are ranked according to their performance.Then the top δ of white shark individuals will be selected for the elite white shark group, and the bottom three white shark individuals will be assigned to the underperforming individuals.The specific updating strategy for the velocity involves updating the speed of the underperforming individuals randomly towards the mean value of the three white shark individuals' positions in the elite group with the best performance.The velocity updating formula is as follows: where c 3 and c 4 are random numbers in the interval [0,1], p 1 and p 2 are the parameters generated by Equations ( 19) and ( 20), P g k denotes the global optimum of the kth generation and mP k is the average of three randomly selected white sharks in the top 5% optimal white sharks, and if there are fewer than three white sharks in the top ε of the population, the average of the top three white sharks will be selected as mP k .

Rotational Invariant Strategy
In the fish school strategy, if the shark cannot follow the best shark, it will stay in its position.This was made into a tracking pause.This behavior will waste the chance of exploitation.In fish schooling, there is information of prey that can be used, which is acquired by itself or neighbor sharks.And this information can be applied to the rotational invariant method [48] to improve the exploitation of white sharks.The strategy of rotation invariance is based on the hyperspheres in the search space, the crucial step of this strategy is calculating the center of the hypersphere.And we modify the original strategy to increase the probability of search.The principle of modified strategy to calculate the hypersphere is shown in Figure 5.

Directional Update Strategy
Degradation refers to the phenomenon where, during velocity update, the locally optimal value obtained by randomly selecting population individuals causes poorer individuals to fail to interact with superior individuals, resulting in stagnation and reduced exploration efficiency.To address this issue, a human learning mechanism is introduced.This mechanism is inspired by the behavior of mutual learning in human society, where individuals with weaker learning abilities tend to seek advice from superior individuals to obtain a better understanding, thereby enhancing the overall learning ability of the population.In this learning behavior, individuals with weaker learning abilities not only learn from the best individuals but also learn from inferior individuals [47].
Based on this concept, within the white shark swarm, there is a phenomenon of mutual learning.They can learn from their peers who hold advantageous positions and adjust their forward velocity by observing the exceptional individuals.Referring to the conclusion of article [47], we can categorize individuals into elite and inferior groups based on a specific population ratio δ and set δ = 5%.In simple terms, if the population size is 100, the first 5 and the last 5 particles will be chosen as the elite particle group and the underperforming particle group, respectively.The screening process is as follows: Firstly, all individuals are ranked according to their performance.Then the top δ of white shark individuals will be selected for the elite white shark group, and the bottom three white shark individuals will be assigned to the underperforming individuals.The specific updating strategy for the velocity involves updating the speed of the underperforming individuals randomly towards the mean value of the three white shark individuals' positions in the elite group with the best performance.The velocity updating formula is as follows: optimal white sharks, and if there are fewer than three white sharks in the top ε of the population, the average of the top three white sharks will be selected as k mP .

Rotational Invariant Strategy
In the fish school strategy, if the shark cannot follow the best shark, it will stay in its position.This was made into a tracking pause.This behavior will waste the chance of exploitation.In fish schooling, there is information of prey that can be used, which is acquired by itself or neighbor sharks.And this information can be applied to the rotational invariant method [48] to improve the exploitation of white sharks.The strategy of rotation invariance is based on the hyperspheres in the search space, the crucial step of this strategy is calculating the center of the hypersphere.And we modify the original strategy to increase the probability of search.The principle of modified strategy to calculate the hypersphere is shown in Figure 5.The center of the hypersphere is determined by three vectors, including P t nw , P t i and the current shark position X t i .Different from the original rotational invariant strategy, we use P t nw to replace the global optimal position.P t nw is the mean of the best prey location recorded by neighbor sharks, determined by Equation (32), P t i is the shark record of the best prey location it meets.p t i is determined by Equation (33), where c 5 denotes a fixed value of 0.1, and r 1 denotes a random number of [0,1].It implies a random distance in the certain direction determined by current and optimal position recorded by each shark; p t g is determined by Equation (34), where c 6 denotes a number with a fixed value of 0.1, and r 2 denotes a random number at [0, 1].It implies a random distance in a certain direction determined by current and global optimal position.Unlike p t i , p t g they obtain a possibility via P so i , which is determined by Equation (35), to remain in the current position.a is a random number on the interval [0, 1].The center coordinates of the hypersphere can be determined by Equation (36).The radius of the hypersphere is determined by the norm of the vector determined by the hypersphere center coordinates and the current shark coordinates.The velocity vector is acquired according to Equation (36).The H i in Equation (37) denotes the random acquisition of a point on the hypersphere.Once we obtain the velocity vector of the ith value shark, the position of the ith white shark can be updated according to the second branch of Equation (21).

Quadratic Interpolation (QI) Operator
After the exploration and exploitation by WSO, the population can be further optimized, and quadratic interpolation (QI) modules are added.The QI operator is a method that enhances the ability of individuals to interact, further exploiting the population's ability to mine local information.Based on the information of neighboring individuals, interpolation points are generated to deepen the exploration of known local information.When conducting quadratic interpolation on excellent individuals, it can more comprehensively explore the positions near excellent individuals, thereby searching for potentially better positions.When conducting quadratic interpolation on inferior individuals, it can drive them closer to excellent individuals and explore more valuable search ranges.The mathematical core of quadratic interpolation is to construct a parabolic fit of the objective function based on three known points, and approximate the extreme points of the objective function through the extreme points of the parabola, thereby quickly finding possible optimal values [49].Figure 6 shows the core idea of the QI operator.
positions.When conducting quadratic interpolation on inferior individuals, it can drive them closer to excellent individuals and explore more valuable search ranges.The mathematical core of quadratic interpolation is to construct a parabolic fit of the objective function based on three known points, and approximate the extreme points of the objective function through the extreme points of the parabola, thereby quickly finding possible optimal values [49].Figure 6 shows the core idea of the QI operator.The specific details are given as follows.
where i is increasing from 1 to n−2.The quadratic interpolation of the selected agent generation is obtained according to Equation (38), when i = n−1, n, x + are chosen to obtain x if f x f qix x qix otherwise where 1 ( ) ( ) ( ) respect the cost function values of three agents.
After executing QI operator, the population 3 S will be formed.

Overall Framework for Path Planning Based on DD-RQIWSO
In the DD-RQIWSO algorithm, the position of an individual represents a path from the starting point to the destination, where the number of key points in the path is considered as the dimension of the path planning problem.During the solving process of DD-RQIWSO, the position information of each white shark individual represents the The specific details are given as follows.S t+1 1 denotes the population after the WSO algorithm updating.The agent in S t+1 1 are sorted according to the cost function from lowest to highest, and we credited the sorted population as S t+1 In the S t+1 2 population, the search agents are selected in order ), where i is increasing from 1 to n−2.The quadratic interpolation of the selected agent generation is obtained according to Equation (38), when i = n−1, n, x t+1 n−1 , x t+1 n , x t+1 where f (x t+1 i ), f (x t+1 i+1 ) and f x t+1 i+2 respect the cost function values of three agents.After executing QI operator, the population S 3 will be formed.

Overall Framework for Path Planning Based on DD-RQIWSO
In the DD-RQIWSO algorithm, the position of an individual represents a path from the starting point to the destination, where the number of key points in the path is considered as the dimension of the path planning problem.During the solving process of DD-RQIWSO, the position information of each white shark individual represents the coordinate components of the key points, and the dimension of the white shark individual is a variable related to the dimension of the path planning problem.In the case of a three-dimensional path planning problem, the dimension of the shark individual, denoted by dim, is twice the number of key points.In the position of the individual, 1 to dim/2 and 1 + dim/2 to dim represent the y-axis and z-axis coordinates of the key points in the route-planning problem, respectively; in the two-dimensional path-planning problem, the dimension of the shark individual is the same as the number of key points, representing the y-axis coordinates in the path; the x-axis coordinates are determined by the starting point, target point and the number of key points, which is illustrated in Section 2.1.A cost function is utilized to evaluate the quality of paths, guiding the algorithm in searching for the optimal solution.By searching for the position of prey in the search space, the algorithm aims to find the optimal solution, which is the shortest path.The process of DD-RQIWSO is shown in Figure 7.
, is twice the number of key points.In the position of the individual, 1 to / 2 dim and 1 /2 dim + to dim represent the y-axis and z-axis coordinates of the key points in the route-planning problem, respectively; in the two-dimensional path-planning problem, the dimension of the shark individual is the same as the number of key points, representing the y-axis coordinates in the path; the x-axis coordinates are determined by the starting point, target point and the number of key points, which is illustrated in Section 2.1.A cost function is utilized to evaluate the quality of paths, guiding the algorithm in searching for the optimal solution.By searching for the position of prey in the search space, the algorithm aims to find the optimal solution, which is the shortest path.The process of DD-RQIWSO is shown in Figure 7.The process of finding the optimal path using the DD-RQIWSO algorithm is as follows: The process of finding the optimal path using the DD-RQIWSO algorithm is as follows: Step 1: Obtain information about the path planning problem, such as the number of populations and the dimension of population, initialize the parameters of DD-RQIWSO and preliminarily evaluate the paths found according to the cost function.
Step 2: Start Iteration.Update the velocity matrix according to the velocity update strategy in the WSO algorithm, and directionally update the velocity for individuals with poor performance.Update the individual positions (paths) based on this velocity matrix and the current position.
Step 3: Execute the original fish swarm algorithm or modified rotation-invariant strategy to update positions, i.e., paths for UAVs, based on the tracking ability of the white shark individuals.Evaluate the found paths and determine the detected optimal path for this iteration.
Step 4: Record the best value of cost function and the solution for path planning.Perform quadratic interpolation to interactively exchange information on detected paths, aiming to dig out potential information, which is used for the next iteration.If the iteration is equal to the max iteration, end the algorithm and return the best value and solution, or repeat from Step 2.

Parameters Analysis of DD-RQIWSO
The major parameters and their impacts are listed as follows: f max and f min determine the frequency of the shark's motion, which controls update step of position.
µ controls the effect of velocity on the position of sharks so that it can regulate the exploitation of the sharks.The value µ relies on τ, and fine-tuning τ decreases the probability of being trapped in local optima.
mv balances global exploration and local exploitation.Weather explore or exploit during an iteration relies on the value of mv.Its value is determined by a 0 , a 1 .The relationship between the two types of behaviors and the parameters is as follows: the exploration capability will be greater and the exploitation will be lower as the a 1 increases.Similarly, the exploration capacity will increase and the exploration capacity will decrease as the a 0 declines.p 1 and p 2 are determined by p max and p min , which related to the iterations.They affect the velocity of the white sharks and cause the search step size to decrease with the number of iterations.Additionally, they regulate the impact of the optimal position of the white shark population on their current whereabouts.
δ determines the number of white sharks that need to undergo learning in the directional update strategy.
s s depends on a 2 and iteration.s s denotes the ability of following the best shark, which determines the strategy to exploitation in the fish schooling.With the iteration increased, it will tend to search around global optimal solutions rather than the optimal of individuals.

Time Complexity Analysis
For the time complexity problem of DD-RQIWSO, it is evaluated from the following aspects, including the number of sharks (n), the dimensionality of the given problem's solution vector (d), the number of iterations (K) and the scale of the cost function (c).The specific formula for calculating the time complexity is as follows: The components of the time complexity of Equation ( 40 Since 1 ≪ Kcn, 1 ≪ Knd, nd ≪ Kcn and nd ≪ Knd, Equation ( 41) reduces to Equation (42).
This indicates that the proposed algorithm's time complexity is polynomial.Thus, the algorithm can be considered as a time-efficient optimization algorithm.From Equation ( 42), it can be seen that the algorithm's time complexity is mainly influenced by the dimension of the solution vector, the evaluation of the cost function, the number of search agents and the number of iterations, all of which depend on the nature of the problem being addressed.

Experiment and Results
To comprehensively evaluate the performance of the DD-RQIWSO algorithm, a series of experiments were designed for validation.Under the same environment, the proposed algorithm was compared with four advanced algorithms, including spider wasp optimizer (SWO) [50], reptile search algorithm (RSA) [33], autonomous groups particles swarm optimization (AGPSO) [51] and white shark optimizer (WSO) [35].SWO and RSA are newly proposed swarm intelligence algorithms, and their results represent the optimization capability of the latest algorithms in this field.WSO is the original algorithm of the proposed algorithm in this paper; its results can be compared with the improved algorithm to determine the effectiveness of the improved strategy.Algorithm results based only on the original algorithm cannot show the results of the improved algorithm well, so the results of the improved algorithm by adding classical algorithms facilitate comparison with the improved algorithm proposed in this paper.The comparison algorithm is optimized in the same way as the DD-RQIWSO algorithm, as shown in Section 4.4.
All simulation experiments were conducted on a personal computer equipped with an AMD Ryzen 7 6800 H with Radeon Graphics, a 3.20 GHz processor and 16 GB of RAM, source from Lenovo (Beijing, China) and the experiments were constructed in MATLAB R2021b (9.11.0.1769968).

Experimental Setup
Experiments were conducted using a personal computer.To eliminate interference from random errors in the experimental results, each algorithm was run independently 10 times.The results showed excellent performance for each algorithm when the population size was set to 150 in the environment described in this study.
The main parameters of each algorithm are shown in Table 1.Table 2 describes the information about the threat area, while Table 3 provides information about the wind field in two and three dimensions, respectively.The z in the three-dimensional case means that the wind field is uniformly distributed along the z-axis.And this experiment only receives the horizontal wind.Additionally, the result of path denotes the best value of the algorithm in 10 independent runs.The w 1 , w 2 , w 3 and w 4 in cost function is set to 0.4, 0.3, 0.1 and 0.2, respectively.And a,b in Equation ( 13) is set to 600 and 400 according to experiment results.(15,34), 15 20 12 9 9 9 9 9 9 10 7 12 5 5 5 5 5 5 (25,19), (47,41), (50,61), (60, 10), (75, 65), (70, 53), (45,20), (75, 74) Case 5 (25,15,40), (25,80,30), (39,40,50)  In Case 1, seven threat areas were set up to test algorithm performance.Table 4 shows the statistical features of the results from 10 independent runs of each algorithm, including the best, worst, average and variance values.As shown in Table 4, the variance of DD-RQIWSO is 1-2 orders of magnitude smaller than the other algorithms.Although the best value of DD-RQIWSO is not superior to the best value of AGPSO in the 15 dimension, its other statistical results, including mean, worst and variance, is superior to AGPSO and other algorithms.The average convergence characteristic curves of the algorithms are shown in Figure 8.According to the convergence curves, DD-RQIWSO exhibits superior convergence characteristics compared to the other four algorithms.The WSO has inferior convergence characteristics, which obtain slower optimization accuracy and fall into local optima prematurely.The accuracy of AGPSO in 10 dimension is slightly faster than the proposed algorithm, but it falls into the local optima in the end.The optimization accuracy of SWO is slow and the final value of cost function is the worst among all algorithm we test.
inal WSO algorithm, this path of DD-RQIWSO is shorter, smoother and bypassing the threat regions.The AGPSO, also known as the improved algorithm, does not perform as well as DD-RQIWSO.In the 10 dimension, the paths of AGPSO and DD-RQIWSO are similar, but in the 15 dimension, AGPSO takes the farther path, and passes through the threat regions.RSA does not possess the ability to plan paths that consider the environment.SWO takes a rough route and long distance in this case.Figure 9 shows the 2D flight path results of each algorithm.Compared with the original WSO algorithm, this path of DD-RQIWSO is shorter, smoother and bypassing the threat regions.The AGPSO, also known as the improved algorithm, does not perform as well as DD-RQIWSO.In the 10 dimension, the paths of AGPSO and DD-RQIWSO are similar, but in the 15 dimension, AGPSO takes the farther path, and passes through the threat regions.RSA does not possess the ability to plan paths that consider the environment.SWO takes a rough route and long distance in this case.(2) Analysis of the experimental results of Case 2 In the second case, the number of threat regions is more than Case 1.The statistical results of 10 independent runs of Case 2 are shown in Table 5.As can be observed from the table, the DD-RQIWSO algorithm provides a good quality path planning for UAVs.The optimal solutions it obtains are better than those obtained from other compared algorithms in terms of the best, worst and mean value.The average convergence curves of these algorithms are presented in Figure 10.It can be seen from the figure that DD-RQIWSO is superior to other algorithms in both final convergence results and convergence speed at 15 dimensions.The convergence accuracy of AGPSO is only slightly faster than the proposed algorithm in 10 dimension, and all of the solutions of AGPSO finally fall into local optima.The performance of SWO is the worst; the convergence accuracy is slowest among all the compared algorithms.The WSO is worse than DD-RQIWSO from convergence accuracy and final solutions in any dimensions.(2) Analysis of the experimental results of Case 2 In the second case, the number of threat regions is more than Case 1.The statistical results of 10 independent runs of Case 2 are shown in Table 5.As can be observed from the table, the DD-RQIWSO algorithm provides a good quality path planning for UAVs.The optimal solutions it obtains are better than those obtained from other compared algorithms in terms of the best, worst and mean value.The average convergence curves of these algorithms are presented in Figure 10.It can be seen from the figure that DD-RQIWSO is superior to other algorithms in both final convergence results and convergence speed at 15 dimensions.The convergence accuracy of AGPSO is only slightly faster than the proposed algorithm in 10 dimension, and all of the solutions of AGPSO finally fall into local optima.The performance of SWO is the worst; the convergence accuracy is slowest among all the compared algorithms.The WSO is worse than DD-RQIWSO from convergence accuracy and final solutions in any dimensions.The path results are shown in Figure 11.The DD-RQIWSO algorithm obtains the optimal path with respect to the other compared algorithms.Not only does the path of DD-RQIWSO successfully bypass the threat regions, but also obtains relative minimum distance.Meanwhile, this path takes into account the effects of wind and offers fairly good path smoothness.The paths generated by the original WSO algorithm yield a poor performance in this case, which not only take further distances, but also fly against the wind.AGPSO claims similar results with DD-RQIWSO in the 15 dimension, but falls into local optima and even exists unnecessarily far from the target in the 10 dimension.SWO and RSA get low ability to detect the search space and yield bad paths.(3) Analysis of the experimental results of Case 3 The experiment results for Case 3 are shown in Table 6.These statistical outcomes show that DD-RQIWSO outperforms other compared algorithms.All of its statistical results are superior to the compared algorithms.The DD-RQIWSO obtains the optimal cost function value and has optimal stability, which is proved by these statistical results.The The path results are shown in Figure 11.The DD-RQIWSO algorithm obtains the optimal path with respect to the other compared algorithms.Not only does the path of DD-RQIWSO successfully bypass the threat regions, but also obtains relative minimum distance.Meanwhile, this path takes into account the effects of wind and offers fairly good path smoothness.The paths generated by the original WSO algorithm yield a poor performance in this case, which not only take further distances, but also fly against the wind.AGPSO claims similar results with DD-RQIWSO in the 15 dimension, but falls into local optima and even exists unnecessarily far from the target in the 10 dimension.SWO and RSA get low ability to detect the search space and yield bad paths.The path results are shown in Figure 11.The DD-RQIWSO algorithm obtains the optimal path with respect to the other compared algorithms.Not only does the path of DD-RQIWSO successfully bypass the threat regions, but also obtains relative minimum distance.Meanwhile, this path takes into account the effects of wind and offers fairly good path smoothness.The paths generated by the original WSO algorithm yield a poor performance in this case, which not only take further distances, but also fly against the wind.AGPSO claims similar results with DD-RQIWSO in the 15 dimension, but falls into local optima and even exists unnecessarily far from the target in the 10 dimension.SWO and RSA get low ability to detect the search space and yield bad paths.(3) Analysis of the experimental results of Case 3 The experiment results for Case 3 are shown in Table 6.These statistical outcomes show that DD-RQIWSO outperforms other compared algorithms.All of its statistical results are superior to the compared algorithms.The DD-RQIWSO obtains the optimal cost function value and has optimal stability, which is proved by these statistical results.The curves describing the features of convergence yield by algorithms are given in Figure 12. (3) Analysis of the experimental results of Case 3 The experiment results for Case 3 are shown in Table 6.These statistical outcomes show that DD-RQIWSO outperforms other compared algorithms.All of its statistical results are superior to the compared algorithms.The DD-RQIWSO obtains the optimal cost function value and has optimal stability, which is proved by these statistical results.The curves describing the features of convergence yield by algorithms are given in Figure 12.According to these convergence curves, DD-RQIWSO outperforms WSO and SWO algorithms.In some situations, RSA and AGPSO convergence outperforms DD-RQIWSO in certain aspects, as analyzed below.The RSA algorithm has good convergence accuracy in the early iterations, but its final convergence value is worse than that of the proposed algorithm and fails to escape from local optima.The convergence accuracy of AGPSO in 10 dimension is faster than DD-RQIWSO, but it does not skip out of the local optimum until the end.SWO shows the worst performance.WSO is worse than AGPSO, but they have similar convergence curves.As shown in Figure 13, the path of DD-RQIWSO algorithm obtains the optimal path with respect to the other compared algorithms.Compared to the other algorithms, the paths of DD-RQIWSO are smoother.WSO has acceptable results in the 10-dimension case.But once the dimensions are elevated, it loses the ability to take the environment into account.The result of AGPSO is similar to DD-RQIWSO in the 10 dimension, but it takes the farther path, and passes through the threat regions more seriously than DD-RQIWSO in the 15 dimension.SWO follows a rough route and long distance in the 10 dimension, and essentially loses the ability to plan paths that consider the environment in the 15 dimension.And RSA does not possess the ability to plan paths that consider the environment in this case.

Value of fitness
Value of fitness As shown in Figure 13, the path of DD-RQIWSO algorithm obtains the optimal path with respect to the other compared algorithms.Compared to the other algorithms, the paths of DD-RQIWSO are smoother.WSO has acceptable results in the 10-dimension case.But once the dimensions are elevated, it loses the ability to take the environment into account.The result of AGPSO is similar to DD-RQIWSO in the 10 dimension, but it takes the farther path, and passes through the threat regions more seriously than DD-RQIWSO in the 15 dimension.SWO follows a rough route and long distance in the 10 dimension, and essentially loses the ability to plan paths that consider the environment in the 15 dimension.And RSA does not possess the ability to plan paths that consider the environment in this case.In the fourth test, a complex three-dimensional scenario was constructed to te performance of the proposed algorithm and other state-of-the-art algorithms.T shows the numerical features of all algorithms' statistical results.As we can see fro table, DD-RQIWSO presents strong stability.Its value of cost function maintains excellent level.The best value of AGPSO is lower than DD-RQIWSO in the 10 dime but its results are too scattered.In contrast, RSA presents good stability, but the va cost function is too high compared with DD-RQIWSO.From a general point of vie DD-RQIWSO shows stable maintenance of a better range of results.In the fourth test, a complex three-dimensional scenario was constructed to test the performance of the proposed algorithm and other state-of-the-art algorithms.Table 7 shows the numerical features of all algorithms' statistical results.As we can see from the table, DD-RQIWSO presents strong stability.Its value of cost function maintains in an excellent level.The best value of AGPSO is lower than DD-RQIWSO in the 10 dimension, but its results are too scattered.In contrast, RSA presents good stability, but the value of cost function is too high compared with DD-RQIWSO.From a general point of view, the DD-RQIWSO shows stable maintenance of a better range of results.Figure 14 exhibits the convergence curves of results of the algorithms.The convergence characteristics of the DD-RQIWSO algorithm excel in comparison to the other algorithms, which can be proved by Figure 14.The convergence accuracy of AGPSO algorithm is little faster than DD-RQIWSO before 15 generations; it also slows down and falls into local optima after that.There is no doubt that the proposed algorithm obtains the optima solution at the fastest accuracy.The flight paths of the UAVs generated by each algorithm in a 3D environment are shown in Figure 15.As shown in the figure, the path of DD-RQIWSO algorithm obtains the optimal path with respect to the other compared algorithms.The flight path of DD-RQIWSO algorithm follows a smooth path.It avoids all threats as it takes advantage of wind.The path generated by WSO becomes tortuous and follows an unwarranted route.The flight path of RSA algorithm provides higher flight attitude in the 15 dimension and increases flight distance in the 10 dimension.SWO's path twists the route and goes through the threat region usually.The path of AGPSO follows long routes when dimension increases, and loses the ability to consider the effect of the environment.
long routes when dimension increases, and loses the ability to consider the effect of the environment.(2) Analysis of the experimental results of Case 5 The environment of this case is more oriented towards the effect of wind on UAVs, excluding the effect of obstacles superimposed on the wind field.Table 8 presents the statistical features of the cost functions of each algorithm in the final experimental scenario.As shown in the table, DD-RQIWSO outperforms other algorithms in most statistical features, only the best result of AGPSO slightly lower than the proposed algorithm.This indicates that the proposed algorithm exhibits excellent optimization performance and stability in this scenario.(2) Analysis of the experimental results of Case 5 The environment of this case is more oriented towards the effect of wind on UAVs, excluding the effect of obstacles superimposed on the wind field.Table 8 presents the statistical features of the cost functions of each algorithm in the final experimental scenario.As shown in the table, DD-RQIWSO outperforms other algorithms in most statistical features, only the best result of AGPSO slightly lower than the proposed algorithm.This indicates that the proposed algorithm exhibits excellent optimization performance and stability in this scenario.(2) Analysis of the experimental results of Case 5 The environment of this case is more oriented towards the effect of wind on UAVs, excluding the effect of obstacles superimposed on the wind field.Table 8 presents the statistical features of the cost functions of each algorithm in the final experimental scenario.As shown in the table, DD-RQIWSO outperforms other algorithms in most statistical features, only the best result of AGPSO slightly lower than the proposed algorithm.This indicates that the proposed algorithm exhibits excellent optimization performance and stability in this scenario.Figure 16 depicts the average convergence curves generated by each algorithm.From the figure, it can be observed that the proposed algorithm has good convergence characteristics.Although the optimization accuracy of the RSA algorithm in 15 dimensions is faster at about 20 generations, it is finally overtaken by the DD-RQIWSO algorithm.As for AGPSO, it presents excellent optimization accuracy in low dimension, but not superior to DD-RQIWSO.As we can observe, SWO and WSO algorithms fall into local optima prematurely, and the accuracy of SWO is pretty bad. Figure 17 illustrates the minimum-cost UAV route generated by algorithms.As can be seen from these figures, the flight paths of DD-RQIWSO demonstrate more efficient obstacle avoidance capabilities.The flight paths of SWO even run through the threat regions in 15 dimensions, and the result of WSO is no longer smooth and through the threat region.
Figure 16 depicts the average convergence curves generated by each algorithm.From the figure, it can be observed that the proposed algorithm has good convergence characteristics.Although the optimization accuracy of the RSA algorithm in 15 dimensions is faster at about 20 generations, it is finally overtaken by the DD-RQIWSO algorithm.As for AGPSO, it presents excellent optimization accuracy in low dimension, but not superior to DD-RQIWSO.As we can observe, SWO and WSO algorithms fall into local optima prematurely, and the accuracy of SWO is pretty bad. Figure 17 illustrates the minimumcost UAV route generated by algorithms.As can be seen from these figures, the flight paths of DD-RQIWSO demonstrate more efficient obstacle avoidance capabilities.The flight paths of SWO even run through the threat regions in 15 dimensions, and the result of WSO is no longer smooth and through the threat region.

Time Analysis
When evaluating the cost of the wind direction, the cost is calculated one time for each sampling point, and the number of sampling points is only related to the number of key points of the path in the algorithm, so it can be found that the path planning can be finished in linear time no matter whether it considers the influence of the wind or not, i.e., the algorithm's time complexity is still O(n).
The running time of if adding the wind field are shown to evaluate the actual time overhead brought by considering the wind field factor, from the results, the algorithm does not lead to the explosive growth of computing time due to the wind factor, and its computation time is acceptable.Under the condition of obtaining reliable paths, the running time required by the proposed algorithm in this paper is also in line with the range  From the figure, it can be observed that the proposed algorithm has good convergence characteristics.Although the optimization accuracy of the RSA algorithm in 15 dimensions is faster at about 20 generations, it is finally overtaken by the DD-RQIWSO algorithm.As for AGPSO, it presents excellent optimization accuracy in low dimension, but not superior to DD-RQIWSO.As we can observe, SWO and WSO algorithms fall into local optima prematurely, and the accuracy of SWO is pretty bad. Figure 17 illustrates the minimumcost UAV route generated by algorithms.As can be seen from these figures, the flight paths of DD-RQIWSO demonstrate more efficient obstacle avoidance capabilities.The flight paths of SWO even run through the threat regions in 15 dimensions, and the result of WSO is no longer smooth and through the threat region.

Time Analysis
When evaluating the cost of the wind direction, the cost is calculated one time for each sampling point, and the number of sampling points is only related to the number of key points of the path in the algorithm, so it can be found that the path planning can be finished in linear time no matter whether it considers the influence of the wind or not, i.e., the algorithm's time complexity is still O(n).
The running time of if adding the wind field are shown to evaluate the actual time overhead brought by considering the wind field factor, from the results, the algorithm does not lead to the explosive growth of computing time due to the wind factor, and its computation time is acceptable.Under the condition of obtaining reliable paths, the running time required by the proposed algorithm in this paper is also in line with the range

Time Analysis
When evaluating the cost of the wind direction, the cost is calculated one time for each sampling point, and the number of sampling points is only related to the number of key points of the path in the algorithm, so it can be found that the path planning can be finished in linear time no matter whether it considers the influence of the wind or not, i.e., the algorithm's time complexity is still O(n).
The running time of if adding the wind field are shown to evaluate the actual time overhead brought by considering the wind field factor, from the results, the algorithm does not lead to the explosive growth of computing time due to the wind factor, and its computation time is acceptable.Under the condition of obtaining reliable paths, the running time required by the proposed algorithm in this paper is also in line with the range of running time of the algorithms in existing research [36].The results are shown in Tables 9 and 10.

Discussion
This study investigated the path planning problem of UAVs under emergency transportation in the marine environment.Optimization algorithms based on swarm intelligence have demonstrated excellent optimization capabilities that can deal with a variety of complex mathematical models and find the optimal solution as long as a suitable cost function is constructed.In this paper, a model of maritime emergency transportation environment is constructed, and the WSO algorithm is further improved so that it yields a good performance in path planning in maritime emergency transportation environment.As validated by experimental results, the proposed algorithm can provide suitable flight routes for UAVs, so that the UAVs can avoid obstacles with the shortest paths and utilize the wind environment as much as possible to reduce energy consumption.Although the experimental results in Section 5.2 show that DD-RQIWSO has a good performance, there are still limitations in the research.In this paper, the wind field under the mathematical model is relatively simple and cannot fully simulate the complex wind direction in nature.Also, the method proposed in this paper cannot handle real-time tasks.Proven by the experiment, as one of problems in the application of UAVs to maritime emergency transportation, the path-planning problem of UAVs can be addressed by the proposed algorithm.

Conclusions
Path planning with complex constraints for UAVs is a hard optimization problem.This paper proposes a new swarm intelligence algorithm to solve this problem.Based on the WSO algorithm, three strategies are incorporated so that it can work more effectively.These strategies are as follows: Firstly, a more elaborate velocity update mechanism is introduced to the algorithm.For the poorly performing individuals, we make them learn from the elites and update their speed based on the position of the elites, thus improving the exploitation ability of the algorithm.Secondly, a modified rotation invariant strategy complements the fish schooling strategy to enhance algorithmic exploitation when they lose the position of the best shark.Finally, a QI operator conducts further fine-tuning of the existing solutions detected by the algorithm at each iteration.It interacts with the information between individuals for existing feasible solutions, so that the high-performing individuals can perform more detailed exploitation, and the low-performing individuals can interact directly with the high-performing individuals to move closer to the optimum.To describe the maritime emergency transportation environment, a maritime environment model is constructed.It considers the effect of obstacles and wind.Through a series of comparative experiments, the research in this paper can effectively solve the path-planning problem of UAVs.
In addition, the study could still be explored further.Firstly, the results in this paper are only for wind fields with a single flow direction.How to utilize more complex wind

Figure 1 .
Figure 1.Schematic diagram of the path generation principle.

:connect the start point and endFigure 1 .
Figure 1.Schematic diagram of the path generation principle.

Figure 3 .
Figure 3.The schematic representation of the wind environment.

Figure 3 .
Figure 3.The schematic representation of the wind environment.

Figure 4 .
Figure 4.The schematic diagram of wind speed cost function.

Figure 4 .
Figure 4.The schematic diagram of wind speed cost function.

where 3 c and 4 c
are random numbers in the interval [0,1], 1 p and 2 p are the parameters generated by Equations (19) and (20), g k P denotes the global optimum of the kth generation and k mP is the average of three randomly selected white sharks in the top 5%

Figure 7 .
Figure 7.The flow chart of proposed DD-RQIWSO algorithm for path planning.

Figure 7 .
Figure 7.The flow chart of proposed DD-RQIWSO algorithm for path planning.

Figure 8 .
Figure 8.The mean accuracy convergence curves for Case 1.

Figure 8 .
Figure 8.The mean accuracy convergence curves for Case 1.

Figure 9 .
Figure 9. Flight paths for UAV in Case 1.

Figure 10 .
Figure 10.The mean accuracy convergence curves for Case 2.

Figure 10 .
Figure 10.The mean accuracy convergence curves for Case 2.

J 30 Figure 10 .
Figure 10.The mean accuracy convergence curves for Case 2.

Figure 12 .
Figure 12.The mean accuracy convergence curves for Case 3.

Figure 12 .
Figure 12.The mean accuracy convergence curves for Case 3.

Figure 13 .
Figure 13.Flight paths for UAVs in Case 3. 5.2.2.Three-Dimensional Scenario for UAV Path Planning (1) Analysis of the experimental results of Case 4

Figure 14 exhibits
Figure 14 exhibits the convergence curves of results of the algorithms.The co gence characteristics of the DD-RQIWSO algorithm excel in comparison to the othe rithms, which can be proved by Figure 14.The convergence accuracy of AGPSO algo is little faster than DD-RQIWSO before 15 generations; it also slows down and fal local optima after that.There is no doubt that the proposed algorithm obtains the o solution at the fastest accuracy.The flight paths of the UAVs generated by each algo in a 3D environment are shown in Figure 15.As shown in the figure, the path o RQIWSO algorithm obtains the optimal path with respect to the other compared rithms.The flight path of DD-RQIWSO algorithm follows a smooth path.It avo threats as it takes advantage of wind.The path generated by WSO becomes tortuou follows an unwarranted route.The flight path of RSA algorithm provides higher attitude in the 15 dimension and increases flight distance in the 10 dimension.SWO' twists the route and goes through the threat region usually.The path of AGPSO fo

Figure 13 .
Figure 13.Flight paths for UAVs in Case 3. 5.2.2.Three-Dimensional Scenario for UAV Path Planning (1) Analysis of the experimental results of Case 4

Figure 14 .
Figure 14.The mean accuracy convergence curves for Case 4.

Figure 14 .
Figure 14.The mean accuracy convergence curves for Case 4.

Figure 14 .
Figure 14.The mean accuracy convergence curves for Case 4.

Figure 16 .
Figure 16.The mean accuracy convergence curves for Case 5.

Figure 16 .
Figure 16.The mean accuracy convergence curves for Case 5.

Figure 16
Figure16depicts the average convergence curves generated by each algorithm.From the figure, it can be observed that the proposed algorithm has good convergence characteristics.Although the optimization accuracy of the RSA algorithm in 15 dimensions is faster at about 20 generations, it is finally overtaken by the DD-RQIWSO algorithm.As for AGPSO, it presents excellent optimization accuracy in low dimension, but not superior to DD-RQIWSO.As we can observe, SWO and WSO algorithms fall into local optima prematurely, and the accuracy of SWO is pretty bad.Figure17illustrates the minimumcost UAV route generated by algorithms.As can be seen from these figures, the flight paths of DD-RQIWSO demonstrate more efficient obstacle avoidance capabilities.The flight paths of SWO even run through the threat regions in 15 dimensions, and the result of WSO is no longer smooth and through the threat region.

Figure 16 .
Figure 16.The mean accuracy convergence curves for Case 5.

•
To simulate real oceanic scenarios, 2D and 3D wind simulation environments based on the Lamb-Oseen vortex model are constructed.A series of tests are conducted on DD-RQIWSO to demonstrate the superiority of the algorithm.

Table 1 .
The parameters of algorithms used for experiment.

Table 2 .
The information of threat area.

Table 3 .
The information of wind.
(1) Analysis of the experimental results of Case 1

Table 4 .
Experimental results of the value of the cost function of all algorithms for Case 1.

Table 5 .
Experimental results of the value of the cost function of all algorithms for Case 2.

Table 5 .
Experimental results of the value of the cost function of all algorithms for Case 2.

Table 6 .
Experimental results of the value of the cost function of all algorithms for test Case 3.

Table 6 .
Experimental results of the value of the cost function of all algorithms for test Case 3.

Table 7 .
Experimental results of the value of the cost function of all algorithms for test Case

Table 7 .
Experimental results of the value of the cost function of all algorithms for test Case 4.

Table 8 .
Experimental results of the value of the cost function of all algorithms for test Case 5.

Table 8 .
Experimental results of the value of the cost function of all algorithms for test Case 5.

Table 8 .
Experimental results of the value of the cost function of all algorithms for test Case 5.

Table 9 .
Comparison of running time of test algorithm for Case 1 in two dimensions.

Table 10 .
Comparison of running time of test algorithm for Case 4 in three dimensions.