An Effective Variable Transformation Strategy in Multitasking Evolutionary Algorithms

Multitasking evolutionary algorithm (MTEA), which solves multiple optimization tasks simultaneously in a single run, has received considerable attention in the community of evolutionary computation, and several algorithms have been proposed in the literature. Unfortunately, knowledge transfer between constituent tasks may cause negative effect on algorithm performance, especially when the optimal solutions of all tasks are in different locations of the unified search space. To address this issue, an effective variable transformation strategy and the corresponding inverse transformation are proposed in multitasking optimization scenario. After using variable transformation strategy, the estimated optimal solutions of all tasks are both near the center point of the unified search space. More importantly, this strategy can enhance the task similarity, and then the effectiveness of knowledge transfer will probably be positive in this case, which can help us to improve the algorithm performance. Keeping this in mind, a multitasking evolutionary algorithm (named MTDE-VT) is realized as an instance by embedding the proposed variable transformation strategy into multitasking differential evolution. In MTDE-VT, the individuals in the original population are first transformed into new locations by the variable transformation strategy. Once the offspring is generated in the transformed unified search space, it must be transformed back to the original unified search space. +e statistical analysis of experimental results on some multitasking optimization benchmark problems illustrates the superiority of the proposed MTDE-VT algorithm in terms of solution accuracy and robustness. Furthermore, the basic principle and the good parameter combination are also provided based on massive simulated data.


Introduction
Optimization problems in science and engineering solved by the evolutionary algorithm (EA) can generally be classified into two groups: single-objective optimization and multiobjective optimization [1][2][3][4]. As a third category of optimization problems, multitasking optimization (MTO) was recently introduced, for solving multiple optimization tasks concurrently [5]. Within this scenario, multiple search spaces corresponding to different tasks, each possessing a unique function landscape, are used.
It is well known that real-world problems seldom exist in isolation and are usually mixed with each other. e knowledge extracted from the past learning experience can be constructively applied to solve more complex or unseen tasks. In contrast to traditional EA, multifactorial evolutionary algorithm (MFEA) has been proposed for MTO problems [5]. As an emerging paradigm in evolutionary computation, MFEA is capable to deal with multiple selfcontained tasks simultaneously by using a single run. Due to knowledge transfer and sharing between tasks, it has demonstrated the capability to outperform its single-task counterpart in terms of both convergence speed and solution quality.
It should be noted here that the good performance of MFEA is greatly affected by the similarity between tasks [6][7][8]. In the literature, there are some similarity measures between multitasks, such as maximum mean discrepancy (MMD), Sperman rank correlation coefficient (SRCC), and crosstask fitness distance correlation (CTFDC) [9]. Intuitively, when the task similarity is high, positive transfer is most likely to occur and helps improve the algorithm performance. On the contrary, when the task similarity is low, negative transfer between tasks may happen and then may cause the population to evolve in the wrong direction or to fall into local optimum.
For MTO problems, different constituent tasks have their own characteristics. Generally speaking, the optimal solutions of all tasks tend to be in different locations of the unified search space. Within the range between those optimal solutions of different tasks, the trend of those objective functions may be in different directions. As a result, the effectiveness of knowledge transfer and sharing in evolutionary multitasking may degrade or even negative in this case. In this paper, our motivation is to map the optimal solution to the center point of the unified search space so that the growth trends of all tasks are similar. Few works have been conducted in the literature for variable transformation in evolutionary multitasking. To the best of our knowledge, Ding et al. have considered the variable transformation in [10], which will be reviewed and distinguished in Section 3.3. is paper thus presents another attempt to fill this gap.
e main contributions of this work can be summarized as follows: (1) An effective variable transformation strategy and the corresponding inverse transformation are proposed in multitasking optimization scenario. (2) is paper proposes a new multitasking evolutionary algorithm, namely, multitasking differential evolution with variable transformation strategy (MTDE-VT). In MTDE-VT, the individuals in the original population are first transformed into new locations by the variable transformation strategy. Once the offspring is generated in the transformed unified search space, it must be transformed back to the original unified search space. (3) Comprehensive experimental study on some MTO problems is carried out to verify the efficacy of the proposed MTDE-VT, and the basic principle and the good parameter combination are also provided based on massive simulated data. e remainder of this paper is organized as follows: in Section 2, a brief description of MFEA, differential evolution (DE), and related works on MTEA is given. Section 3 begins with a detailed introduction of multitasking differential evolution. e motivation and mathematical definition of variable transformation strategy and the corresponding inverse transformation are also presented in this section. e proposed MTDE-VT algorithm for MTO problems is then described in detail. Section 4 discusses the experimental studies conducted to verify the efficacy of our proposed MTDE-VT in the multitasking optimization scenario. Finally, Section 5 concludes this article with several remarks and a discussion on potential future works.

Multifactorial Evolutionary Algorithm.
As the first realization of MTEA, MFEA shrewdly combines the genetic algorithm (GA) into a MTEA paradigm [5]. To seamlessly transfer the valuable knowledge in multitasking, all candidate solutions are encoded into a unified search space by a random key scheme. Accordingly, while tackling one of the optimization tasks, a subset of variables are extracted from the corresponding chromosome and decoded into a task-specific solution space. en, during population evolution of MFEA, assortative mating and vertical cultural transmission are both utilized to generate offspring, thereby leading to population diversification and implicit knowledge in multitasking optimization scenario. e notion of assortative mating guarantees that the individuals from the same task have high probability to mate and generate an offspring. Following vertical cultural transmission in MFEA, the phenotype of offspring is directly inherited by its parents. We again recommend [5] for more details on these procedures.

Related Works on Multitasking Evolutionary Algorithms.
Since the first invention of MFEA in 2016, due to its simple implementation and strong search capability, intensive research efforts have been devoted to this area. For the sake of clarity, these works can be divided into three categories: algorithm framework, algorithm improvement, and typical application [11].
A general framework, evolution of biocoenosis through symbiosis (EBS), was proposed for EAs to deal with manytasking problems [12]. is framework has two main features, i.e., the adaptive control of knowledge transfer among different tasks and the selection of candidates for evaluation from concatenate offspring. Another multipopulation evolution framework (MPEF) was first established for MTO, wherein each optimization task possesses an independent population and has its random mating probability for exploiting the useful knowledge of other tasks [13]. Recently, MTO with dynamic resource allocation (MTO-DRA) was proposed [14]. It can allocate and control computational resources to each task adaptively according to the requirements. Hou et al. [15] proposed an evolutionary transfer reinforcement learning framework for multiagent intelligent system, which can adapt to the dynamic environment.
In multitasking optimization scenario, the major function of across-population reproduction is knowledge transfer between different subpopulations in order to accelerate the search process. In addition to routine operations [13,[16][17][18], an interesting reproduction approach is called explicit genetic transfer across tasks [19]. By configuring the input and output layers to represent two task domains, the hidden representation provides a possibility for conducting knowledge transfer across task domains. However, this approach will fail on combinatorial optimization problems. To handle this issue, a weighted l 1 -norm-regularized formulation was derived to capture the sparse mapping across vehicle routing problems (VRPs), and a solution-based knowledge transfer was also proposed by Feng et al. [20]. A novel genetic transform strategy has been proposed very recently [21]. Based on the two mapping vectors, two parent individuals can be mapped to the vicinity of the other solutions. More recently, a new MFEA with adaptive knowledge transfer called MFEA-AKT was presented to achieve an efficient and robust performance [22]. It is able to adaptively configure the appropriate crossover operator for knowledge transfer across tasks based on the information collected along the evolutionary search process. A novel multiobjective MTEA algorithm (called MOMFEA-SADE), the winner of the Competition on Evolutionary Multitask Optimization within IEEE 2019 Congress on Evolutionary Computation, was proposed in [23]. Particularly, two transforming matrices are used to transform the search space of the population and reduce the probability of negative knowledge transfer between tasks.
Another potential research direction is parameter control or adaptation, especially for random mating probability (rmp). In classic MFEA, it is used to balance the diversity and convergence capability of the obtained solutions and is set as a constant of 0.3 [5]. If a task can be improved more times by the offspring from other tasks, the probability of knowledge transfer should be increased; otherwise, this rate will be decreased [12]. A simple random searching method was introduced to adjust this parameter [14]. e current rmp is stored in the candidate list when at least one of the K best solutions is updated by a better solution. Otherwise, the parameter is adaptively determined. It is very recently reported that an online rmp estimation technique was proposed in order to theoretically minimize the negative interactions between distinct optimization tasks [24]. Specifically, the extent of transfer parameter matrix is learned and adapted online based on the optimal blending of probabilistic models in a purely data-driven manner. We note that this approach can be applied to the domain of multiobjective optimization [25] and permutation-based discrete optimization [26] as well.
In the literature, there exist a lot of works to apply multitasking evolutionary algorithms to tackle real-world problems, such as complex supply chain network management [27], composites manufacturing problem [28], pollution-routing problem [29], operational indices optimization of beneficiation process [30], HIV protease cleavage sites prediction [31], hyperspectral image unmixing [32,33], and time series prediction problem [34].

Differential Evolution.
As a branch of stochastic EA, differential evolution, originally proposed by Price and Storn in 1995 [35], has been proven to be a simple yet effective, robust, and reliable global optimizer. Like other EAs, it starts with a population of N p vectors representing the candidate solutions, where N p indicates the population size. Let us assume that X i,g � [x i,1,g , x i,2,g , . . ., x i,D,g ,] is the ith candidate solution vector in generation g, where i � 1, 2, . . ., N p , D is the problem's dimension, and g is the generation index. For the original DE, its working depends on the manipulation and efficiency of three operators: mutation, crossover, and selection, which are described as follows.
e mutation operation enables DE to explore the search space and maintain diversity. In the literature, five mutation strategies have been commonly used [36], detail description of which are given as follows: . , v i,D,g ] denotes the mutant vector generated for each individual X i,g , r1, r2, r3, r4, and r5 are the random and mutually exclusive integers chosen from the set {1, 2, . . ., N p }, F is the scaling factor which controls the amplitude of the difference vector, and X best,g gives the solution with the best fitness found so far. e aim of the crossover operator is to build trial vectors by recombining the current vector and the mutant one. e family of DE algorithms employs two crossover schemes: exponential and binomial crossovers. e binomial crossover is utilized in this paper and is briefly discussed below. In the binomial crossover, the trial vector U i,g � (u i,1,g , u i,2,g , . . . , u i,D,g ) is defined as follows: where CR ∈ [0, 1] is the predefined crossover rate, rand is a uniform random number within [0, 1], and rand j ∈ {1, 2, . . ., D} is a randomly selected index which is used to ensure that at least one dimension of the trial vector is changed. After the crossover, a greedy selection mechanism is used to select the better one between the parent vector X i,g and the trial vector U i,g according to their fitness values f(·). If, and only if, the trial vector is better or equal than the parent vector, then it replaces the current vector, otherwise the current vector survives while the trial one is eliminated, as described below: ⎧ ⎨ ⎩ (7)

Proposed Algorithms: MTDE and MTDE-VT
3.1. MTDE. As mentioned above, MTEA is a new and general framework for solving MTO problems where diverse population-based search mechanisms are employed. In particular, MTEA is conducted with the DE algorithm, namely, MTDE in this paper. Its basic structure is depicted in Algorithm 1.

Complexity
In the previous work [11], from theoretical analysis and simulation experiment, MFEA and its variations were reviewed carefully in view of multipopulation evolution model. erefore, it should be especially explained that MTDE is also conducted as the multipopulation evolution model, wherein each population is represented independently and evolved for the selected task only.
In the original MFEA, the skill factor of each individual should be computed and updated every generation. On the contrary, the obvious feature of multipopulation evolution strategy is that the new candidate solution generated by mutation and crossover is assigned to a fixed subpopulation. One of the advantages of this approach is to analyze the contribution of each population. e main difference between DE and MTDE are mainly in mutation strategy. In the original DE/rand/1 explained in Section 2.3, three parents are chosen randomly from the same population (task) and then a mutant vector V i,g is generated by using Equation (1). On the contrary, in MTDE, a random value rand is first generated in the range of [0, 1]. If rand is less than the predefined mating probability value rmp, across-population mutation is executed, where three parents are chosen randomly from different populations (tasks), as shown in lines 2-7 of Algorithm 2. Otherwise, the original mutation is done as shown in lines 9-10 of Algorithm 2.
Note that, in order to avoid population drift, we use a new mutation approach across subpopulations P k and P r (k ≠ r) in MTDE as follows: where V k i,g , X k r1,g , X k r2,g , and X k r3,g are the different vectors in subpopulations P k and X r r1,g , X r r2,g , and X r r3,g are the vectors in subpopulations P r .

Motivation.
In order to illustrate the motivation of variable transformation visually, some problem instances and conceptual map are depicted in Figure 1 in multitasking optimization scenario. Specifically, in the top of Figure 1, two multitask problems are shown, in which the blue solid and the red dotted line represent task 1 and task 2, respectively. Next, in the middle of Figure 1, only the exact information of optimal solution and boundary value are reserved, and the other details of each task are ignored deliberately. In this way, a simple conceptual map of multitasking optimization problems can be built easily. Note that different tasks with the same optimum will map into the similar conceptual map, as shown in Figure 1. Lastly, for each task, the key transformations (the variable transformation strategy and the corresponding inverse Input: K tasks (T 1 , T 2 , . . ., T K ) Parameter: population size (N p ), crossover rate (CR), scaling factor (F), and random mating probability (rmp) Output: K best solutions (S 1 , S 2 , . . ., S K )C Step 1: initialization (1) For each task T k do (2) Generate an initial population P k randomly (3) Evaluate all individuals in population on P k based on task T k (4) Find the best solution S k for task T k (5) End for Step 2: reproduction and selection (6) While stopping conditions are not satisfied do (7) For each task T k do (8) For each individual X i k in population P k do Step 2.1: mutation (9) If rand < rmp then (10) across-population mutation. Refer to Algorithm 2. (11) Else (12) original mutation. Refer to Algorithm 2. (13) End if Step 2.2: crossover (14) Crossover. Refer to Algorithm 2. Step 2.3: selection (15) Selection. Refer to Algorithm 2. (16) End for (17) Update the best solution S k for task T k (18) End for (19) End while Step 3: output the results (20) Return K best solutions (S 1 , S 2 , . . ., S K ) ALGORITHM 1:Multitasking differential evolution. 4 Complexity transformation) in the bottom of Figure 1 are described in Section 3.3. For MTO problems, different constituent tasks have their own characteristics. Generally speaking, the optimal solutions of all tasks tend to be in different locations of the unified search space, as shown in the left column in Figure 1. In this case, the offspring can be generated across the entire search space by assortative mating (e.g., across-population crossover). In this way, the population diversity remains at high level during the entire optimization process. Note, however, that excessive diversity may seriously slow down the algorithm convergence.
ese MTO problem instances can be further abstracted into a simple conceptual map as shown in the middle row of Figure 1. It has been found that within the range between these optimal solutions of multitasks, the trends of those objective functions are in different direction. As illustrated in Figure 1, the optimal solutions of two tasks, indicated by blue solid and red dashed lines, are located at 0.2 and 0.8, respectively. It is observed that from 0.2 to 0.8, it is getting larger and larger for task 1, while the observed trend of task 2 is in the opposite direction. In this particular case, the similarity between two tasks is considered to be the lowest. erefore, the effectiveness of knowledge transfer in evolutionary multitasking may degrade or even negative in this case. As a result, the multitasking evolutionary algorithm (e.g., MFEA) does not work well for these MTO problems due to inefficient knowledge transfer.

Variable Transformation Strategy.
To address the above limitation of all MTEA algorithms for MTO problems whose optimums are located in different positions in the unified search space, this paper proposes an effective variable transformation strategy in the multitasking optimization algorithm. e detail of variable transformation strategy is summarized in Algorithm 3.
Input: target vector in the current generation (X k i,g ) Parameter: population size (N p ), crossover rate (CR), scaling factor (F), and random mating probability (rmp) Output: target vector in the next generation (X k i,g+1 ) Step 1: mutation / * Generate a mutant vector V k i,g for each target vector X k i,g * / (1) If rand < rmp then / * across-population mutation * / (2) Select six vectors X k r1,g , X k r2,g , X k r3,g ,X r r1,g , X r r2,g , and X r r3,g randomly from two subpopulations P k and P r where i ≠ r1 ≠ r2 ≠ r3 and k ≠ r / * original mutation * / (9) Select three vectors X k r1,g ,X k r2,g , and X k r3,g randomly from the current subpopulation Step 2: crossover / * Generate a trial vector U k i,g for each target vector

End if (19) End for
Step 3: selection / * Select a better target vector in generation g 6 Complexity e variable transformation strategy (see ① in the bottom of Figure 1) and the corresponding inverse transformation (see ② in the bottom of Figure 1) are defined as equations (9) and (10), respectively: where cp � (0.5, 0.5, . . ., 0.5) is the center point of the unified search space, p is the solution in the original unified search space, and op is the corresponding solution in the transformed unified search space. Furthermore, m is the estimated optimal solution, which can be calculated as the mean value of the top μ * N p best solutions in the current generation according to Equation (11). Note that four key variables (p, op, m, and cp in Equations (9) and (10)) are marked in the bottom of Figure 1: For example, the population size N p � 10, and the percent of top best solutions μ � 40% in one case. us, the top 4 (�40% × 10) best solutions in the current generation can be used to determine the estimated optimal solution m. Let us suppose p best As illustrated in the right column of Figure 1, after using variable transformation strategy in multitasking scenario, the estimated optimal solutions of all tasks are both near the center point of the unified search space, and the individuals in the left/right region of each task will be mapped (or scaled) into the left/right region. More importantly, according to the right conceptual map in Figure 1, in both the left and right regions of the unified search space, it is getting smaller and smaller for all tasks. erefore, the task similarity is very high, and then the effectiveness of knowledge transfer will probably be positive and can help us to improve the convergence rate in this case.
More recently, another attempt for variable transformation was conducted by Ding et al. in [10]. In particular, each individual in the population is translated to a new location according to the following equation: where p i and op i (i � 1, 2, . . ., N p ) are the ith solution and the corresponding transformed solution, respectively, in the unified search space, N p is the population size, and the translated value d k is estimated based on the promising solutions of the kth task. e main purpose of this approach proposed by Ding et al. is that the transformed optimums of all tasks are located at the same position in the unified search space. Note that the translated direction and distance are both fixed for all individuals. Unfortunately, it is easy for the individuals to jump out of the legal range, and then manual efforts (e.g., random selection within a predefined range) are required to ensure the legality of candidate solutions.
us, this will destroy the original population distribution characteristics.
On the contrary, our approach in this paper is capable of dealing with these issues. To be specific, the variable transformation strategy and the corresponding inverse transformation are both linear transformation as shown in equations (9) and (10). eir output is limited strictly in the whole unified search space [0, 1]. As a result, the relative quality of all individuals will remain unchanged after using variable transformation strategy. It should be noted that the Input: individual (p) in generation g Parameter: start-stop generation (φ), generation interval (θ), percent of top best solutions (μ), and maximum number of generations (Max gen ) Output: individual (op) in the transformed unified search space (1) If g > φ * Max gen (or g < φ * Max gen ) then (2) If mod(g, φ * Max gen ) � 0 then (3) Calculate the estimated optimal solution (m) according to Equation (11)  (4) Calculate the corresponding solution (op) in the transformed unified search space according to Equation (9 Complexity idea of calculating the estimated optimal solutions, defined by equation (11) in this paper, is borrowed from [10].

MTDE-VT.
In order to show the effectiveness of variable transformation strategy, we propose herein a new multitasking evolutionary algorithm, namely, MTDE-VT in this paper. e difference between the original MTDE and MTDE-VT is mainly in the process of offspring reproduction. At each generation, the individuals in the original population are first transformed into new locations by the proposed variable transformation strategy according to equation (9) and then undergo across-population mutation as a conventional genetic operator. Once the mutant vector is generated in the transformed unified search space, the offspring must be transformed back to the original unified search space according to equation (10). e detail of acrosspopulation mutation with variable transformation strategy is displayed in Algorithm 4.
Note that, the variable transformation strategy and the corresponding inverse transformation can only be applied to across-population mutation. When the original mutation is selected for execution based on random mating probability, the variable transformation strategy does not need to be executed in MTDE.

Experimental Setup. Similar to the pioneering work by
Gupta et al. [7], the same benchmark problem set is employed to assess the performance of MTDE-VT in this paper. e utilized test suite includes nine MTO problems, and each problem contains two distinct minimization tasks. Based on the degree of intersection of the global optima, these problems can be divided into three categories: complete intersection (CI), partial intersection (PI), and no intersection (NI). Moreover, based on the similarity between the fitness landscapes, they can also be categorized into three groups: high similarity (HS), medium similarity (MS), and low similarity (LS). More details about these benchmark problems can refer to the technical report [37] and the homepage of Prof. Yew-Song Ong (http://www.ntu.edu.sg/ home/asysong/).
As mentioned above, the idea and definition of the estimated optimal solutions are borrowed from [10]. erefore, three key parameters of variable transformation strategy (φ, θ, and μ in Algorithm 3) are set as the recommended values in [10]. For fairness of comparison, all parameter settings for the proposed MTDE-VT and other algorithms are kept identical to [37] and listed as follows: It is noted that in order to minimize the effect of stochastic nature on each measured metric, the reported result is the average over 50 trials.
Lastly, the empirical studies presented in this paper are conducted under Windows XP OS using the computer with a 2.2 GHz Intel Corei3 processor and 4 GB RAM.

Experimental Series 1: Comparison of MTDE and MTDE-VT.
e experimental results of MTDE and MTDE-VT are provided in Table 1, and the best entry for each task is highlighted in boldface. Furthermore, the Wilcoxon rank sum test is also adopted at a significance level of 0.05 [38].
From the results in Table 1, we can see that MTDE-VT performs significantly better than the original MTDE on 8 out of 18 tasks and has equivalent performance on other 10 tasks. is result indicates that the variable transformation strategy is able to maintain the high-quality solution during the population evolution and knowledge transformation between tasks.
Interestingly, the best optimal solution obtained by MTDE-VT is better than the original MFEA on 16 out of 18 tasks, while the mean optimal solution by MTDE-VT is better than the MFEA on 14 tasks. Although the average value is a good index of the overall performance of stochastic algorithms, the best value obtained is a more valuable indicator. e reason is that MTDE-VT can always get a satisfactory solution by trying more times. In our opinion, the best optimal solution obtained by MTDE-VT benefits from the fast convergence speed is compared with MFEA, regardless of the final performance level. In fact, the proposed variable transformation strategy has great exploitation ability. When the estimated optimal solution is near the global optimal solution, MTDE-VT leads to fast convergence to the global best solution. On the contrary, when it is near the local optimal solution, MTDE-VT may easily trap into local optimal solution or even cannot solve successfully the problem.

Experimental Series 2: Universality for Evolutionary
Computing. One of the most prominent features of variable transformation strategy is its universality for all evolutionary algorithms. e single prerequisite to variable transformation strategy is the differential optimal solutions of all tasks to be solved simultaneously. Obviously, this hypothesis of the proposed strategy and its corresponding computing process has nothing to do with the benchmark evolutionary algorithm. In other words, the variable transformation strategy works well for any swarm and evolutionary algorithms.
For simplicity, in order to reveal its universality of variable transformation strategy, several common variants of DE are tested in this section. For example, MTDE (F � 0.7, CR � 0.3) represents the basic MTDE/rand/1 algorithm with new parameters (F � 0.7, CR � 0.3), MTDE(best/1) is the new 8 Complexity MTDE/best/1 algorithm under default parameters (F � 0.5 and CR � 0.9), and so on. Comparatively speaking, two different vector strategies, such as DE/rand/2 algorithm, have the ability to improve the diversity by producing more trial vectors, and the DE/best/1 algorithm speeds up convergence due to its powerful exploitation ability. eir details are given in Section 2.3. e experimental results of several variants of MTDE and MTDE-VT algorithms are summarized in Table 2, and the Wilcoxon rank sum test is also adopted at a significance level of 0.05. First of all, we observe that, in most cases, the performance of the MTDE (or MTDE-VT) algorithm with diverse parameters or mutation strategies is distinctly different. For example, when solving task 1 of problem 1 (CI + HS), the obtained solution by using the MTDE/rand/1 algorithm under F � 0.5, CR � 0.9 (the first row in Table 1) is near the true optimal solution, while the mean solution obtained by using the MTDE/rand/2 algorithm (the first row in Table 2) is really bad.
is fact from a side indicates that these variants of MTDE (or MTDE-VT) can be considered as independent evolutionary algorithms.
By comparing different MTDE and the corresponding MTDE-VT algorithms, the effect of variable transformation strategy on different evolutionary algorithms can be observed and measured. Impressively, the MTDE/best/1 algorithm is obviously improved by variable transformation strategy. Except for the task 1 of problem 3 (CI + LS), the MTDE_VT/best/1 algorithm performs significantly better than the corresponding MTDE on the remaining 17 tasks.
However, the outcome of the MTDE-VT/rand/2 algorithm is not very ideal, and compared with MTDE/rand/2, it even goes bad for task 1 of problem 4 (PI + HS). We hold that the variable transformation strategy reveals a very positive effect on evolutionary algorithm with powerful exploitation. e possible reason may be that, for these MTO problems, MTDE/rand/2 under the given parameters can improve the population diversity but speed down the convergence to some extent.
Fortunately, the result of the MTDE-VT (F � 0.7, CR � 0.3) algorithm can further examine the rationality of the above explanation. Generally, a large value of CR speeds up convergence, and a larger F increases the probability of Input: target vector (X k i,g ) Output: mutant vector (V k i,g ) (1) Select six vectors X k r1,g , X k r2,g , X k r3,g , X r r1,g , X r r2,g and X r r3,g randomly from two subpopulations P k and P r where i ≠ r1 ≠ r2 ≠ r3 and k ≠ r (2) Compute the estimated optimal solution m in subpopulation P k (3) Compute the translated vectors OX k r1,g , OX k r2,g , OX k r3,g , OX r r1,g , OX r r2,g and OX r r3,g according to Equation (9) (4) If rand < 0.5 then (9) Compute the mutant vectors V k i,g according to Equation (8) ALGORITHM 4: Across-population mutation with variable translation strategy.  erefore, the MTDE (F � 0.7, CR � 0.3) algorithm has a good exploration ability, and the corresponding MTDE-VT (F � 0.7, CR � 0.3) algorithm does not reveal much more signification affection on the performance improvement. Specifically, from Table 2, it just performs significantly better than MTDE on 5 out of 18 tasks.

Experimental Series 3: Parameter Sensitivity Analysis.
As illustrated in Algorithm 3, the variable transformation strategy consists of three control parameters (φ, θ, and μ).
Here, we analyze their effects on the performance of MTDE-VT. To this end, we test MTDE-VT with various parameter settings on nine MTO problems. In this paper, we set μ � 2%, 10%, 20%, 40%, and 50%; φ � 0.2_start, 0.1_start, 0, 0.8_stop, and 0.9_stop; and θ � 0.05, 0.02, 0.01, and 0.001 as the potential values. e MTDE-VT algorithm runs under all of these parameter combinations. In total, that is exactly 100 parameter combinations (or MTDE-VT algorithms) in this paper. e other parameters and differential evolution operators remain the same as those in Section 4.1.
It should be noted that, in this paper, the variable transformation strategy may not be conducted at the beginning or the end of population evolution. Specifically, g > φ * Max gen and g < φ * Max genn represent two conditions, respectively, as described by Line 1 in Algorithm 3. In this paper, the suffix "start" and "stop" are used to distinguish the two cases clearly. For example, φ � 0.2_start means that the variable transformation strategy is conducted from the 0.2 * Max gen generation to the end of population evolution. Similarly, φ � 0.8_stop means that it is conducted from the very beginning to the 0.8 * Max gen generation.
To compare the performance of MTDE-VT algorithms under diverse parameter settings, we rank the tested algorithms as their reward points. For each algorithm, the reward point is the obtained rankings with respect to each component task of each MTO problem. en, we accumulate the obtained reward points over all component tasks of all MTO problems to obtain the final score.
For instance, Tables 3 and 4 give an illustrative example of scoring procedure. Specifically, Table 3 gives the median fitness values obtained by all algorithms with different parameters.
e smaller the fitness value is, the better the performance of the algorithm under the given parameter is. en, we get the reward points of all algorithms with respect to each component task, as depicted in Table 4. According to their final scores, we can conclude that Algorithm A is the best algorithm and B is the worst one over three MTO test problems.
Generally speaking, poor individuals are rare but have a great impact on the mean fitness values. In consideration of both statistical features and fairness, we use the median fitness rather than the mean fitness, to rank and score the MTDE-VT algorithms with diverse parameter settings. e total score (reward points) of all algorithms are illustrated in Figure 2. Note that, the x-axis represents the serial number of MTDE-VT algorithms with different parameter settings. For example, the "96" in Figure 2 means that μ � 2%, φ � 0.9_stop, and θ � 0.001 as mentioned above.
As can be observed from Figure 2, MTDE-VT under the "76" parameter setting (μ � 2%, φ � 0.2_start, and θ � 0.001) gets the highest score (1276 points), which means it offers the best algorithm performance over all MTO problems. e median best fitness and the corresponding ranking are recorded in the last row in Table 1. We can see that MTDE-VT (optimal) performs significantly better than the original MTDE-VT on 12 out of 18 tasks and is worse than it on 6 tasks. e original MTDE-VT ("34" parameter setting) studied in Section 4.2 gets 1149 reward points, and ranks 57 among all test parameter settings. e detail results of the best fitness are displayed in Table 5. Taken individually, the MTDE-VT algorithm with "81" parameter setting exhibits the best performance on 10 tasks, and "76" parameter setting does well on 2 tasks. Furthermore, the algorithm performance trends of three control parameters are investigated, as shown in  e algorithm performance is measured by the mean reward points over all tasks and all parameter combinations. For example, for the mean reward points of μ (e.g., 2%), we should take into account all 20 parameter combinations of φ (namely, 0.2_strat, 0.1_start, 0, 0.8_stop, and 0.9_stop) and θ (namely, 0.05, 0.02, 0.01, and 0.001).
As mentioned above, μ represents the top percent best solutions in the current population. A smaller μ means a few but excellent individuals are chosen to calculate the  estimated optimal solution. As a result, the estimated optimal solution can lead the whole population to search for more potential areas. Certainly, in the extreme case (e.g., μ � 2% in this paper), only the best solution are used to estimate the optimal solution of the whole population, which may destroy the diversity of the population. erefore, its performance drops sharply, as shown in Figure 3. Moreover, θ is another key parameter of variable transformation strategy. A large number means few variable transformation strategies are executed, while a small number means more frequent operation. From Figure 4, it is reasonable to set a smaller value of θ (e.g., θ � 0.01). It indicates that the proposed variable transformation strategy can help the population to evolve towards the global optimal solution. On the contrary, this strategy cannot be used too frequently because it is possible to fast fall into local optimal solution. For example, when θ � 0.001, the variable transformation strategy is executed every generation. Unfortunately, its mean reward points had actually fallen relative to the optimal case (θ � 0.01 in this paper). e third parameter, φ, can decide when to start and stop the variable transformation strategy. e parameter values we studied here can be divided into three groups. e first group is 0, which means the variable transformation strategy is executed during the whole evolution process. e second one is 0.1_start and 0.2_start, which mean it is executed from 0.1 * Max gen or 0.2 * Max gen to the end, respectively. e third one is 0.8_stop and 0.9_stop, which mean it is executed from the beginning to 0.8 * Max gen or 0.9 * Max gen , respectively. We can see from Figure 5, MTDE-VT under the second parameter group illustrates the best performance compared with other two parameter groups. ere is reason to believe that it is good choice to run the variable transformation strategy after   12 Complexity some generations. In other words, the function of the proposed strategy will be more obvious at the beginning of population evolution than the ending part.
From Figures 3-5, it can be observed that the optimal parameter should be set as μ � 0.1, φ � 0.2_start, and θ � 0.01. In fact, MTDE-VT under this parameter combination ranks fifth out of 100 algorithms. Although it is not the best one, it is a satisfactory good recommendation in most cases.

Conclusions
In this paper, MTO problem with different optimal solutions has been investigated. It is found that, within the range between these optimal solutions, the trends of these objective functions are in different directions, which may lead to negative knowledge transfer. To boost the task similarity and robust evolutionary multitasking performance in solving MTO problems, a novel variable transformation strategy and the corresponding inverse transformation are proposed. After using variable transformation strategy, the estimated optimal solutions of all tasks are near the center point of the unified search space. Furthermore, all individuals in the transformed space will evolve to the center point, avoiding advancing in the opposite direction. erefore, the effectiveness of knowledge transfer will probably be positive and can help us to improve the convergence speed in this case.
To crystallize this idea, we embed the proposed variable transformation strategy into MTDE, forming a novel algorithm (called MTDE-VT algorithm). e experiment results demonstrate that variable transformation strategy is an effective approach to exploit the promising areas and it works well for all evolutionary algorithms. Furthermore, the basic principle and the good parameter combination are also provided based on massive simulated data.
For future works, we would like to further study the effectiveness of variable transformation strategy when solving more complex benchmark and practical MTO problems. Moreover, we also plan to explore the similarity measure between constituent tasks and design new strategy to boost the task similarity, which can be considered as guidance for applying variable transformation strategy into the evolutionary algorithm.

Data Availability
e data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare that they have no conflicts of interest.