A Convex Optimization Approach to Multi-Robot Task Allocation and Path Planning

In real-world applications, multiple robots need to be dynamically deployed to their appropriate locations as teams while the distance cost between robots and goals is minimized, which is known to be an NP-hard problem. In this paper, a new framework of team-based multi-robot task allocation and path planning is developed for robot exploration missions through a convex optimization-based distance optimal model. A new distance optimal model is proposed to minimize the traveled distance between robots and their goals. The proposed framework fuses task decomposition, allocation, local sub-task allocation, and path planning. To begin, multiple robots are firstly divided and clustered into a variety of teams considering interrelation and dependencies of robots, and task decomposition. Secondly, the teams with various arbitrary shape enclosing intercorrelative robots are approximated and relaxed into circles, which are mathematically formulated to convex optimization problems to minimize the distance between teams, as well as between a robot and their goals. Once the robot teams are deployed into their appropriate locations, the robot locations are further refined by a graph-based Delaunay triangulation method. Thirdly, in the team, a self-organizing map-based neural network (SOMNN) paradigm is developed to complete the dynamical sub-task allocation and path planning, in which the robots are dynamically assigned to their nearby goals locally. Simulation and comparison studies demonstrate the proposed hybrid multi-robot task allocation and path planning framework is effective and efficient.

Sung et al. [21] addressed a distributed approach to integrate multi-robot deployment and multi-target tracking. A local algorithm is adopted to achieving performance close to the optimal algorithms with limited communication with assistance of distributed approach. For a heterogeneous multi-robot system, where tasks form disjoint groups and where there are restrictions on the number of tasks, a robot may accomplish (both within the overall mission and within each task group), Luo et al. [34] provided a provably-good distributed task allocation methods. Their goal of task allocation is to maximize (minimize) the total payout (cost) of the robots, whereby each robot receives a payment (or incurs a cost) for completing a job. Basil et al. [35] proposed a modular robot to be a morphologically flexible, autonomous kinematic machines with hundreds or even millions of modules that work together to exhibit intelligent behavior. The self-reconfiguration process, which seeks to identify a series of reconfiguration activities to convert robots from an initial form to a target one, may be enhanced by clustering the modules in modular robots. Luo et al. [24] extended distributed algorithms by taking deadline into account in multi-robot deployment. Purohit et al. [26] presented a spanning tree method associated with self-localizing capability in the graph. However, robot teams are not yet successfully clustered and assigned to multiple tasks effectively. To efficiently and securely explore and recon a given region with a large number of robots, Li et al. [36] introduced an enhanced genetic algorithm (IGA) to tackle the job assignment issue of a multi-robot system. Searching the numerous identical sections of the specified region is a subtask that must be completed in order to solve a challenge. Qin et al. [37] formally described the challenge of completing dynamic tasks with several robots by modeling their interactions as a series of state transitions, or behavior trees. Through the use of a unique priority system, a framework-associated distributed algorithms for inter-robot communication, negotiation, and agreement protocols was provided. Bai et al. [38] investigated the multi-robot task allocation issue, in which a team of geographically-dispersed robots must effectively move a number of packages from their originating locations to their respective destinations within a certain amount of time. A market-based approach is suggested by Rossi et al. [29] for simultaneous task subdivision and allocation in heterogeneous multi-robot systems. García et al. [30] implemented a behavior-based architecture with many layers allowing the market-based method to achieve varying degrees of coordination. However, as the number of robots in the team or the complexity of the problem increases, market-based approaches suffer from scalability and dynamics issues, which tend to hinder these processes, especially when this happens in real time.
Neural network (NN) methods have been broadly applied to multi-robot path planning and task allocation. Luo and Yang [33] developed a neural dynamics model to assign multiple robots to environmental exploration collaboratively. Multiple robots cooperate to achieve a common sweeping goal effectively. However, energy consumption of multi-robot system has not been considered in this paper. Later, Luo et al. [32] extended their research by addressing the computational complexity and energy efficiency of multi-robot system with navigation [39]. The energy consumption of multiple tasks for an arbitrary number of robots is considered, in which a bio-inspired neural networks model for multi-robot navigation applied to cleaning robots is developed. In this model, multiple robots are assigned to complete terrain coverage task cooperatively, extendable to unknown exploration environments [39]. Distance cost optimization is considered in the recent research. For instance, Lee et al. [31] suggested a master-slave based multi-robot deployment with time and distance minimization consideration; however, the distance is not globally optimized. Some researchers combined a couple of models to take advantage of various benefits. For instance, Michael et al. [22] effectively integrated distributed algorithm and marketbased method for multi-robot deployment. Motes et al. [40] concatenated path-finding method and conflict-based search method to multi-robot deployment and navigation with inter-team conflict avoidance.
The centralization or decentralization of task allocation methods is another essential characteristic. Using the well-known Kiva warehouse robot as an example [41], both task allocation and path planning are conducted centrally; however, this may not be feasible if the multi-robot system operates in an environment that lacks robust sensing and computational capabilities. Clustering is a potential middle alternative between centralized and decentralized methods that aims to balance performance and computational load [42]. Martin et al. [43] proposed an algorithm to group the players into balanced clusters, applied randomized methods to large problems to relieve the computational load, and assessed feasibility in a large scenario and contrasted with a genetic approach.
In general, market/auction-based approaches can be solved decentrally [44]. However, the problem structure must be straightforward enough so that each agent can act as a bidder and bid on tasks; this can be challenging when some tasks require the cooperation of multiple agents. Optimization-based methods permit more complex problem structures but can be challenging to solve decentralized. Bo et al. [45] proposed a stochastic programming framework, which optimizes the decomposition, allocation, and scheduling of tasks for a group of agents. The framework enables teams of mobile robots in different locations to perform different tasks. However, the scalability and dynamics issues arise as the number of robots in the team or the complexity of the problem increases. Therefore, a framework based on optimization is proposed in this paper for decentralized task allocation that is appropriate for complex problem models.

Proposed Framework and Original Contributions
In this paper, a two-stage team-based decentralized deployment framework through convex optimization model is developed. It aims for multiple robots scattered in arbitrary space to minimize distance cost among robots, and between robots and goals so as to reach all the task locations effectively shown in Figure 1.

Clustered Robot Teams
Interconnected Teams Targets Targets Target Target Figure 1. Illustration of an exploration environment with robot teams (swarms) (adapted from Eklavya 2019 [46]). In this multi-robot deployment problem, a known environment is given with N target locations, in which some locations are located within the workspace (such as warehouses), but other locations are in the edges of the workspace. There are varying numbers of robots in each team.
In the first stage, multiple robots in workspace are classified and clustered into a variety of teams enclosing correlative robots as task decomposition. Robot teams in arbitrary shape are represented as circles, in order to formulate it into a convex optimization model.
The relative locations of teams are obtained through this convex optimization model. Then, the locations of circles are refined in a refinement stage via a Delaunay triangulation method to clean up the overlaps of teams. In the second stage, within the team, a self-organizing map (SOM) neural networks (NN) paradigm is considered to complete the dynamical sub-task allocation and path planning by dynamically assigning them to their nearby goals locally. The proposed framework couples task decomposition, deployment, local sub-task allocation and path planning to support cases where the optimal solution depends on robot interrelation, dependencies, and availability, as well as inter-team conflict avoidance.
The contributions of this paper are summarized as follows: (1) A two-stage convex-optimization-based framework is proposed for decentralized multi-robot task allocation and task decomposition. (2) The first stage of the proposed convex-optimization-based framework is designed to determine the relative locations of the robot teams. By obtaining data regarding the robots and the targets, the robots are categorized and aggregated into multiple teams with respect to the total distance cost. (3) The second stage of the proposed convex optimization-based framework aims to locally assign teams of robots to final goals. The local self-organizing map based neural network (SOMNN) method is developed to subtask allocations and robot path planning. (4) A Delaunay triangulation (DT) method is employed to refine the team locations and, thus, connect the two stages.
The overall workflow of the proposed multi-robot task allocation framework is illustrated in Figure 2.

Local Subtask Allocation
Path Planning

Refinement of Team Locations
The Positions of Robots The Positions of Targets

Convex Optimization
Self-organizing map based Neural Networks (SOMNN)

Multi-robot Deployment
Multi-robot Task Allocation The rest of this paper is organized as follows. In Section 2, the problem statement and formulation is presented. In Section 3, the distance optimal-based convex optimization model for multi-robot deployment is proposed. Section 4 presents the SOMNN subtask allocation and path planning approach. Numerical experiments, simulations, and comparison studies are presented in Section 5. Several important properties of the presented framework are summarized in Section 6.

Problem Statement and Formulation
In this multi-robot deployment problem, a known environment is given with N target locations, in which some locations are located within the workspace (such as warehouses), but other locations are in the edges of the workspace (see Figure 1). There are m robots to be deployed that are classified into g swarms (groups). In every swarm, there are k robots (k is a variable). Robots are initially located within different irregular shapes as swarms (see Figure 1). Robots moves as a swarm while maintaining a minimized total distance cost among robots shown in Figure 3. Interrelated robots are classified into the same swarm to be deployed in a nearby terrain. Some robots are enclosed in one swarm in collaboration with the robots in other swarms. Robots with relatively high connections are arranged close to one another for connectivity. The distance between pairs of robots with high connection is to be minimized. Likewise, the distance between robots and target locations is to be minimized.

Robot Team 1 Robot Team 2
Robot Team 3 Figure 3. Illustration of multi-robot deployment with robot teams. Robots move as a swarm while minimizing the total distance cost between them. The classification of interrelated robots into the relevant swarm for deployment on adjacent terrain. Some robots are contained within one swarm in conjunction with robots from other swarms.
The relative locations of the robot swarms in the known environment are provided by an attractor-repeller convex optimization algorithm. The robots enclosed in swarms are deployed into their relative locations in the workspace. Given the relative locations of the robots, a planar graph and relative location matrix are obtained by a Delaunay triangulation approach, to enforce no overlap between any two circles, used for the next step. The robots contained in their swarms are locally assigned to their locations. The robots in a group are required to have the shortest possible distance from the fixed target location on the edge of the working environment.

Convex Optimization Model
The proposed framework couples task decomposition, deployment, local subtask allocation and path planning to support scenarios where the optimal solution depends on robot interrelation, dependencies, and availability, as well as inter-team conflict avoidance. We introduce an efficient technique that addresses the deployment problem of a team of heterogeneous robots. For multiple scattered tasks in arbitrary space, the objective to be solved is to minimize distance for robots to reach all the task locations.

Convex Optimization Algorithms
Convex optimization algorithms to minimize the total distance cost are described in this section. There are several definitions of the proposed algorithms.

Definition 1. Team of robots.
The deployment of robot swarms is described as follows: • Each swarm of robots is labeled 1, 2, ..., M, represented as a circle with radius r i , i = 1, 2, ..., M. The radius r i is determined by the number of robots in this swarm. • The location of each swarm 1, 2, ..., M is given by the coordinates of its center depicted as (x i , y i ). • The non-negative cost per unit distance between swarms i and j is denoted by c ij , which is equivalent to the weight between swarms.
• The distance measured from center to center of swarms i and j by Euclidean distance Therefore, the multi-robot deployment problem may be mathematically formulated as weighted distance minimization model among swarms as follows.
The circles containing robots are deployed as close as possible, but they should not overlap. The objective function ∑ 1≤i<j≤M c ij d ij attempts to make distance d ij as low as possible, which attracts pairs of circles i and j towards each other so as to function as an attractor. The constraints, , push any pair of circles away from each other with no overlapping [47]. Swarms are approximated by circles whose radii are proportional to the amount of encircled robots. The constraint to prevent circles from overlapping has the mathematical form: where (x i , y i ) and (x j , y j ) denote the coordinates of the centers of two circles i and j, whereas r i and r j represent their corresponding radii, respectively. The distance between circles is minimized, while they have no overlaps as constraints. This problem may be formulated as a non-linear optimization model with its superiority of convexity, inspired by a dynamic spring system [48] as follows. min is penalized in the total energy function to generate a repulsive force. d ij and r i , r j of circles are defined above. If d ij < r i + r j , then circles i and j overlap. d ij ≥ r i + r j prevents circles i and j from overlapping each other. In order to formulate convex optimization model, in which circles enclosing robots are formulated to have no overlaps, repulsive force is introduced. In addition to this attractive force, we consider move this max function by introducing repulsive force. Afterwards, robots as swarms are deployed into their appropriate potions. Target distance concept employed in this model was initially introduced by Anjos and Vannelli [49]. Let each swarm of robots i be represented by a circle with radius r i , where r i is proportional to √ a i , the square root of the area of circle i, measured by the amount of robots encircled.
Following [49], the target distance for each pair of circles i, j is defined as t ij := α(r i + r j ) 2 , where α > 0 is a parameter. To prevent circles from overlapping, the target distance is enforced by introducing a penalty term f ( The objective function is thus given by The repeller term (i.e., the penalty function) enforced by holding a positive value, functions as a repulsive force, if r i + r j > d ij , to hinder the circles from overlapping.
The attractor in the objective function used to apply an attractive force to the two circles, makes the two swarms of robots (circles) move closer together (see Figure 4). The repeller term disappears or becomes a negative value implying that there is no any overlap between circles, if r i + r j ≤ d ij . An attractive force is enforced to the two circles through the attractor in the objective function enforces, if D ij ≥ t ij . In this case, there is no any overlap between circles and the repeller term is zero or slightly negative. Conversely, the repeller term is positive, which tends to positive infinity as D ij tends to zero so as to prevent the circles from overlapping fully, if D ij < t ij . There is no force between circles i and j exactly if D 2 ij = t ij /c ij . The generalized target distance T ij is defined to contribute to this optimization model.

Attractive Force
Mobile robots

Repulsive Force
Mobile robots The model aims to ensure that D ij t ij = 1 at optimality, so choosing α < 1 sets a target value t ij that allows some overlap of the respective circles, which means that the nonoverlap requirement is relaxed. In practice, by properly adjusting the value of α we achieve a reasonable separation between all pairs of circles. Let M and P denote the set of mobile teams (circles) and the set of targets (goals), respectively. Target distances are applied only for pairs of mobile robot teams (circles). The complete attractor-repeller (AR) model is s.t.
where (x i , y i ) are the coordinates of the centre of circle i as previously defined; w F , h F are the width and height of the workspace; and w low F , w up F , h low F , and h up F are the lower and upper bounds on the width and height, respectively. The first two sets of constraints require that all the circles be entirely contained in the workspace, and the remaining two pairs of inequalities bound the width and height of the workspace of robots. In particular, for certain robot environment, we set w low F = w up F =w F and h low F = h up F =h F , wherew F and h F are the width and height of the workspace to be explored by robots.

Definition 2. Generalized target distance
where ε > 0 is sufficiently small to ensure that D ij ≈ In real-world applications, the distances D ij between the circles should be inversely proportional to c ij representing the weights on the wire-length, and should be proportional to the relative size of the teams through the value of t ij . Hence, a generalized target distance, T ij , is defined such that D ij ≈ T ij at optimality. Using T ij , a convex version of the AR model may be described in the following section with the following term.
It is clear that this problem is convex, and that by construction F ij attains its minimum value whenever the locations of circles i and j satisfy D ij ≤ T ij . This includes the case where D ij = 0, i.e., both circles completely overlap. The idea is to add to the objective function a term of the form − ln D ij /T ij for each pair i, j of circles. Hence, the model solved in the first stage of our method is s.t.
where β is a parameter selected empirically. K is chosen to reflect the weights of all the pairs of mobile circles (teams) in the objective function by K = ∑ i<j c ij .
The topological relationships between terms are obtained in this first stage. Without the term −βK ln D ij /T ij in Equation (8), this problem is convex. By solving it, the solution of the first stage provides relative locations within the workspace for all the robot teams (circles) represented by circles. The relative locations of the robot swarms in the known environment are provided by an attractor-repeller convex optimization algorithm shown in Figure 5. The team-based convex optimization algorithm for robot deployment is summarized in Algorithm 1.

Algorithm 1: Team-based convex optimization algorithm for robot deployment
Input: Initial configuration (q s ), Goal configuration (q g ), and Required cost c ij , d ij Output: Path (or sequence of nodes from q s to q g ), Robots' final locations (L g oal) Begin 1: Classify m robots into k teams T i , i ∈ [1, k]; 2: Approximate teams T i to k circles at center of T j (x j , y j ), j ∈ [1, k]; 3: Solve the convex optimization model (Equation (8)) having C j (x j , y j ), j ∈ [1, k]; 4: Obtain relative locations of teams. New circle locationsC j (x j ,ỹ j ), j ∈ [1, k]; 5: Apply SOMNN to subtask allocation to reach g goals G(x l , y l ), l ∈ [1, g]; End

Refinement of Team Locations
The circles are allowed to overlap in the convex optimization stage and the precise locations of team have not been determined. The solution of the first stage provides relative locations for all the circles enclosing robots. In this paper, we consider a relative location matrix to encode relative locations. Using this technique the non-overlap constraints that are originally disjunctive, non-linear, and non-convex can be linearized and easily enforced in the second stage model. We consider the Delaunay triangulation (DT) method for two main reasons, (i) it spreads out circles thus teams in the workspace and (ii) it transforms the relative location graph into a planar graph. In this framework, robot deployment resolved by the convex optimization with relative locations in Figure 5 is refined by the proposed DT method shown in Figure 6.

SOM Neural Networks for Sub-Task Allocation and Path Planning
In this stage, a swarm of robots are assigned locally as sub-tasks to their goals through self-organizing map based neural networks (SOMNN) method in static or dynamic environments. The principal technical superiority of this SOMNN model is well established, given that the robot trajectory planning is fused with the sub-task allocation in every team and their goals. Once the relative locations of teams with robots are defined, the robots move to their goals in the dynamical environments. In our research, there are k teams, each of which has a swarm of robots. There are α robots in one team T 1 (α) in a workspace, which are assigned to η goals with pre-defined locations, such as recharging pile for autonomous electric vehicles. In this paper, SOM-based neural network model is made up of two layers of neurons (nodes). The first layer configured as the input layer consists of two neurons (u i , v i ) representing coordinates of the ith goal G i (u i , v i ). The second layer as the output layer contains α × η neurons, denoted as r 1 1 , r 2 1 , . . . , r η 1 , r 1 2 , r 2 2 , . . . , r η 2 , r 1 α , r 2 α , . . . , r η α , representing the locations of the α robots and their trajectories. All locations of goals form input dataset. Each neuron in the output layer is fully connected to the neurons in the input layer [50] (see Figure 7). A weight vector connecting the two input nodes to output nodes is defined as rη α = wη α (x), wη α (y) , whereα = 1, 2, · · · , α;η = 1, 2, · · · , η. The weight vectors of the neurons for each robot are initialized based on the coordinates of the initial robot location. Therefore, α robots move to the η goals while following planned path, due to this SOM neural network algorithm. A winner Nη α is determined in light of the following criterion.
where P is a parameter determining the equitable distribution of sub-task of workload for robots. |·| represents the Euclidean distance. The parameter P is expressed as where α is the trajectory length of theα th robot (α = 1, 2, · · · , α).Ṽ A is the average trajectory length of the robots, given by the following Equation (16).
The winner implies the neuron in the group of the output neurons with a lower workload of that location, and the neuron with the minimum distance toward the input data. The SOM NN is updated by modifying its weigh vectors rη α = wη α (x), wη α (y) (α = 1, 2, · · · , α;η = 1, 2, · · · , η), by the following rule (17), until the weight vectors remain unchanged.
where ϑ is the learning rate. µ is a little constant, usually smaller than 0.5. ψ is the minimum distance between any two neurons of the goal locations. f (dη˙α) is the neighborhood function, defined as where Λ is a small constant denoting the range of neighborhood, normally less than 0.4. dη α is the distance measured between the neuron and the winner neuron, Nη α , from the group of the output neurons. The gain constant ψ with an initial value of 10 is given by where ψ is the gain changing rate usually determined empirically. Usually, the range of ψ is between 0.001 and 0.05. Smaller ψ leads to shorter total trajectory and longer computational time. t is the number of iterations. The implementation of the proposed multi-robot deployment method is described in Algorithm 2.

Simulated Experiments and Comparison Studies
In this section, simulation and comparison studies are conducted to validate our proposed framework.

Numerical Experiments by Convex Optimization
Numerical experiments are carried out to validate the convex optimization model. In the first set of experiments, we focus on the multi-robot deployment problem through the convex optimization model with equal number of robots and goals, summarized in Table 1. It aims to experimentally verify the correctness of the results obtained from the proposed algorithm. In this experiment, multiple robots are enclosed in their teams before they are deployed to their goals. It assumes that number of robots is equal to the goals, except in Team 2, 50 robots will be assigned to two goals. The total distance the robots traveled is calculated. The first set of experimental results are illustrated in Figure 8.  In the second set of experiments, multi-robot deployment through the proposed convex optimization model is obtained with different numbers of robots and goals. The results are summarized in Table 2. The second set of experimental results are illustrated in Figure 9.  For lack of space, we present our numerical experiments concerning some maps and scenarios, but our results are representative for a broad range of environments. In the next section, we will consider two maps obtained from the publicly available papers.

Evaluation Using Standard Environments
In this section, complex robot allocation experiments in standard environments with multiple constraints are conducted to further validate the robustness and effectiveness of our proposed convex optimization framework. Standard environments, such as apte, xerox, hp, ami33, and ami49, provided by Microelectronics Center of North Carolina (MCNC) are used in our analyses.
In these scenarios, robots are clustered into predetermined teams. For instance, in the apte environment, 287 robots are distributed into nine teams (Table 3). They are required to perform complex task allocation under 97 constraints. Constraints are limitations or rules that need to be respected by the optimization framework. Several constraints are listed in Table 4. The first constraint is to ensure that nine robots must reach goal location 37. These nine robots must be one from each team T 1 , T 2 , . . . , T 9 , respectively, as outlined in Table 4. The second constraint is to ensure that eight robots reach goal location 55, and each of these eight robots must be one from each team T 1 , T 2 , . . . , T 8 , respectively. Similarly, the third constraint is that two robots must reach goal location 17, and these two robots must be from groups T 1 and T 2 . Some robots in the teams are required to reach specific goals. In the apte standard environment, 97 such constraints must be respected. The initial apte environment setting of robots and goals is shown in Figure 10. In this stage, the robots are clustered into nine teams that are assigned to goals located on the boundary. In such scenario, our framework identified an optimized solution on an average in 0.69 s where the average length of all robots path is 425.09 m.  Similarly, in the xerox standard environment with 203 constraints, 698 robots were distributed into 10 teams and were tasked to reach two goals. On average, the optimized solution in this scenario was discovered in 1.12 s where the average path length of all robots is 411 m. Likewise, our framework was able to identify optimal solutions in hp, ami33, and ami49 environments in an average of 1.17, 14.16, and 9.96 s, respectively. As presented in Table 3, the average path lengths for all environments are 154.84, 65.31, and 699 m, respectively. From Table 3, it can also be observed that though number of constraints are more in xerox environment compared to apte, the average length discovered by our framework is smaller in the xerox environment than that of apte. Similar observations can be made from the results for other environments. This is attributed to the fact that not all constraints in different environments are similar. Therefore, the execution time taken by the model and the path length required by the robots to reach, respectively, goals vary significantly based on the environments.

Simulations and Comparison Studies
Two test environmental maps are obtained using a SLAM (Simultaneous Localization and Mapping) algorithm through a publicly available dataset [26]. We used these two building maps to test our framework by assuming the correlative robots using nodes as targets. In this building test scenario, we demonstrate our framework is applicable for the seven-robot deployment. As there are seven nodes corresponding to seven targets, seven robots will be assigned to these seven rooms illustrated in Figure 11a. In this scenario, the deployment aims to deploy at least one robot in each of nodes (rooms). Robots are assigned to seven rooms with minimized travel distance cost, which is formulated to the developed convex optimization model. As this test case is straightforward with only seven targets, the relative locations of solution is obtained from the convex optimization model before they are refined and spread out. Finally, these seven robots are deployed to seven rooms with total distance minimized.
A more complicated indoor room environment is depicted in Figure 11b. In this test scenario, 22 rooms are supposed to be deployed by 22 robots, in which one goal is reached by at least one robot. Our model initially selects Nodes 7, 9, 10, and 13 as the main teams. Based on the interconnection depicted in Figure 11b, a team consists of several robots to be deployed. In this case, Node 7 consists of Robots r 1 , r 2 , r 3 , r 4 ,r 5 , r 6 , r 11 , and r 9 . Node 10 contains Robots r 20 , r 21 , and r 22 . Node 13 contains Robots r 9 , r 14 , and r 15 . Node 9 contains Robots r 12 , r 16 , r 17 , and r 19 . Therefore, our developed convex optimization model creates relative locations of Nodes (Teams) 7, 9, 10, and 13. The total distance cost of among these nodes/teams is minimized, where the 22-robot swarms are assigned their final goals in this building. After applying for the refinement of teams, we obtain their locations of nodes. In each team, it consists of a few robots, such as Node 7 consists of Robots r 1 , r 2 , r 3 , r 4 , r 5 , r 6 , and r 11 . The next stage, these seven robots driven by the SMNN model are navigated to seven rooms. It is clear that node as team containing robots could be the team member of another node. For instance, Node 9 is a member but it is contained in Node 7. This is beneficial of the developed convex optimization able to solve this sort of optimization problem.
In this test scenario of simulation, these 22 robots are clustered into four teams in task decomposition according to their interconnection of robots and correlation of teams. There are four teams, N 7 with 10 robots; N 9 with 9 robots; N 10 with 4 robots; and N 13 with 3 robots. First stage with our convex optimization model creates relative locations of four teams, in which the total distance cost is minimized shown in Figure 11c. Once the relative locations of these four teams (circles) are determined, a DT method ia applied to spread out them to their appropriate locations shown in Figure 11d. The correlation of teams are shown in Figure 11d. For instance, N 7 is assumed to connect with N 9 and N 10 . Robots within their teams are located in vicinity of their targets (rooms). The SOMNN model is used to assign members of robots to their targets (rooms). Therefore, the robots are deployed to their final targets-rooms illustrated in Figure 11b.
Our framework is compared with Hungarian algorithm [51]. Assume that m robots are assigned to k tasks located at certain goals. The interrelation of robots are assumed. Our framework clusters these robots into teams before they are assigned to goals. Hungarian algorithm directly assigns robots to their tasks and goals. We simulate 6, 11, 30, 33, and 49 robots to be deployed to same number of goals. The results show that, in terms of total path length, our proposed model is 10.14%, 14.26%, 8.69%, 3.96%, and 18.22% lower than the Hungarian algorithm for 6, 11, 30, 33, and 49 targets, respectively. The distance robots traveled to goals as a measure is compared with Hungarian algorithm and illustrated in Figure 12.

Conclusions
A new framework of team-based multi-robot deployment is proposed through convex optimization model. Multiple robots are deployed to their appropriate locations while the total distance cost is minimized, by the convex optimization mission and robot path planning. The SOMNN paradigm is used to dynamically fulfill the sub-task allocation task. The proposed model couples task decomposition, deployment, local subtask allocation and path planning to support cases where the optimal solution depends on robot interrelation, dependencies, and availability, as well as inter-team conflict avoidance. Simulation and comparison studies validate the effectiveness of our proposed framework. This framework will be implemented on actual robots in the near future work.