Parallel Binary Rafflesia Optimization Algorithm and Its Application in Feature Selection Problem

Pan, Jeng-Shyang; Shi, Hao-Jie; Chu, Shu-Chuan; Hu, Pei; Shehadeh, Hisham A.

doi:10.3390/sym15051073

Open AccessArticle

Parallel Binary Rafflesia Optimization Algorithm and Its Application in Feature Selection Problem

¹

College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, China

²

Department of Information Management, Chaoyang University of Technology, Taichung 41349, Taiwan

³

Departments of Computer Information System and Computer Sciences, Faculty of Computer Science and Informatics, Amman Arab University, Amman 11953, Jordan

^*

Author to whom correspondence should be addressed.

Symmetry 2023, 15(5), 1073; https://doi.org/10.3390/sym15051073

Submission received: 25 April 2023 / Revised: 9 May 2023 / Accepted: 10 May 2023 / Published: 12 May 2023

Download

Browse Figures

Versions Notes

Abstract

:

The Rafflesia Optimization Algorithm (ROA) is a new swarm intelligence optimization algorithm inspired by Rafflesia’s biological laws. It has the advantages of high efficiency and fast convergence speed, and it effectively avoids falling into local optimum. It has been used in logistics distribution center location problems, and its superiority has been demonstrated. It is applied to solve the problem of continuity, but there are many binary problems to be solved in the actual situation. Thus, we designed a binary version of ROA. We used transfer functions to change continuous values into binary values, and binary values are used to symmetrically represent the meaning of physical problems. In this paper, four transfer functions are implemented to binarize ROA so as to improve the original transfer function for the overall performance of the algorithm. In addition, on the basis of the algorithm, we further improve the algorithm by adopting a parallel strategy, which improves the convergence speed and global exploration ability of the algorithm. The algorithm is verified on 23 benchmark functions, and the parallel binary ROA has a better performance than some other existing algorithms. In the aspect of the application, this paper adopts the datasets on UCI for feature selection. The improved algorithm has higher accuracy and selects fewer features.

Keywords:

parallel strategy; Rafflesia Optimization Algorithm; transfer function; feature selection

1. Introduction

With the rise of the Internet industry, the scale of data available to us is getting larger and larger, so this requires more and more of our ability to perform information filtering. Data mining has received increasing attention in recent years [1,2]. The acquisition of data requires not only comprehensive but also efficient access [3]. Before formally processing data, we need to perform data pre-processing. Feature selection is a good way of data pre-processing [4]. In a large dataset, there are many features that may be unnecessary or redundant and not relevant to our ultimate goal of classification, so it is necessary to reduce the dimensionality of the data and focus on exploring those features that are more representative to achieve a more efficient classification and processing problem [5,6]. However, in our inexperience, it is difficult to estimate which features are useful and which ones are redundant.

Davies has shown that the feature selection problem is an NP problem, which means that we can only find the optimal subset of features by exhaustive enumeration. However, the time cost of using the exhaustive method is excessively high. This cost is unacceptable in most cases. Therefore it is expected that a more optimal method can be used to solve this problem. A heuristic algorithm is a method that expects an optimal or suboptimal solution at an acceptable cost and has become a mainstream approach to solve complex problems [7,8,9,10,11,12]. Particle swarm optimization is a very classical heuristic algorithm developed based on the behavior of biological swarms in nature [13,14,15,16]. Whale optimization algorithm is inspired by the social behavior of humpback whales, which is more competitive than traditional algorithms and has been applied to various fields [17,18,19,20]. In recent years, heuristic algorithms have received more and more attention from scholars, and many excellent algorithms have been proposed [21,22,23,24,25,26,27]. According to the shortcomings of algorithms, many heuristic algorithms for further optimization have also been proposed by scholars [28,29,30,31]. ROA is an algorithm based on the living habits of Rafflesia, and has the advantages of fast convergence and not easily falling into local optimum [32].

Traditional optimization algorithms are generally applied to solve continuous problems. Many optimization algorithms are applied to various problems [33,34,35,36,37]. However, in reality, there are many discrete problems that need to be solved, such as feature selection. To date, many scholars have proposed many binary algorithms, and all of them have achieved good results [38,39,40,41]. The binary PSO algorithm has been applied to the problem of feature selection with good results, and this is the first time that a binary algorithm has been proposed [42]. The improved binary Grey wolf optimization algorithm is proposed by Hu [43]. In the binary algorithm, we will use 1 or 0 to symmetrically indicate whether the corresponding feature in the dataset is to be used for classification. In the paper, a new parallel strategy is proposed and new transfer functions are introduced to achieve better results in the application of the feature selection problem.

Parallel strategies have been widely used in recent years to improve heuristic algorithms and have obtained good results [44,45]. The parallel strategy is to group the original populations, explore them independently of each other, and communicate with them every certain number of iterations through a certain communication strategy to achieve faster convergence and prevent premature maturity [46,47,48]. Schutte has demonstrated the effectiveness of parallel strategies through numerous experiments [49].

The binarization of algorithms is often implemented with transfer functions [50,51]. The S-type transfer function is applied in the binary PSO algorithm to complete the binarization of the continuity values [42]. Zhuang applied a four-family transfer function for the binarization of the quasi-affine transformation evolution algorithm and demonstrated the superiority of the algorithm by 23 benchmark test functions [52]. This paper focuses on the binarization of the ROA algorithm using transfer functions and the optimization of the algorithm by a parallel strategy. The improved algorithm was tested on the UCI dataset for feature selection and compared with the binary GWO and binary PSO algorithms, and the experimental results showed that the algorithm outperformed the above two algorithms in some cases.

The general organization of this paper is as follows. Section 2 describes related works. Section 3 proposes new transfer functions based on mathematical analysis and proposes a parallel strategy to optimize the algorithm. Section 4 demonstrates the performance of the algorithm by testing it on benchmark functions. Section 5 presents the application of the algorithm to feature selection. Section 6 summarizes and discusses this paper.

2. Related Works

2.1. ROA

Rafflesia is a saprophytic organism. Its main axis is extremely short, and there are no leaves and underground stems. When it blooms, it emits a peculiar smell, thus attracting certain insects to pollinate it. In this process, because of its unique structure, some attracted insects may be trapped during pollination and die. After the flowering period has passed, the petals will wither, and the fruit will be ripe at this time. There will be many tiny seeds in the fruit. After the fruit falls to the ground, the seeds will be randomly taken to various places in various ways to find a suitable germination place. Based on the above characteristics, the ROA algorithm is divided into three stages to implement.

The stage of attracting insects is divided into two strategies. The first strategy seeks to replace the inferior individual. In this strategy, 1/3 of the individual with poor fitness is replaced by a new one. Each dimension of the newly added individual is abstracted into a three-dimensional space for calculation, and the dimension model is illustrated in Figure 1. The equations for calculating the individual

X_{i}

position are (1)–(3) to update its position.

X_{i_{k}} = X_{b e s t_{k}} + d \cdot s i n β_{k} c o s γ_{k}

(1)

d = \sqrt{\sum_{k = 1}^{D} {(X_{R_{k}} - X_{b e s t_{k}})}^{2}}

(2)

X_{w o r s t_{i}} = X_{i}

(3)

where d is the distance between

X_{i}

and

X_{b e s t}

.

X_{R}

is a random individual in the population.

X_{b e s t}

is the best individual in the population.

X_{w o r s t}

represents the poor individual in the population.

β_{k}

is a random value between [0,

π

/2];

γ_{k}

is random value between [0,

π

].

The second strategy is to update 2/3 of individuals with better fitness. The individual velocity update equation is derived from the insect flapping flight model. The individual speed update equations are as shown in Equations (4)–(7). The individual update Equation is (8):

\vec{v_{1}} = \frac{ω_{0}}{2} \sqrt{A^{2} s i n^{2} (ω_{0} t + θ) + B^{2} c o s^{2} (ω_{1} t + θ)}

(4)

\vec{v_{2}} = \vec{v_{2}} ω_{0} c o s (ω_{0} t + θ + ϕ)

(5)

\vec{v} = \vec{v_{1}} + \vec{v_{2}}

(6)

\vec{l e} = C \cdot \vec{v} \cdot t + (X_{b e s t} - X_{(t)}) \cdot (1 - C) \cdot r a n d

(7)

X_{(t)} = X_{(t)} + \vec{l e}

(8)

where

\vec{v_{1}}

and

\vec{v_{2}}

represent the translation speed and rotation speed, respectively, and

ω_{0}

and

ω_{1}

represent the frequency periods of flapping and lateral flapping wings, respectively, both with values of 0.025. A is the amplitude of the wing during movement, with the value of 2.5. B is the lateral offset, with a value of 0.1.

ϕ

representing the phase difference between translation and rotation, with the value of −0.78545. The value range of

θ

is (0, 2pi). The initial value of

v_{2}

is within the range (0, 2pi). C is a random number in (−1, 1).

According to the principle of “natural selection by nature, survival of the fittest”, the stage of swallowing insects will eliminate the individual with the worst fitness every certain number of iterations, thus ensuring the quality of the solution and speeding up the efficiency of the algorithm.

At the stage of spreading seeds, the position of the Rafflesia is represented by the initial optimal individual at this stage. At the same time, other individuals will randomly search around for an environment suitable for growth. At this stage, individuals update the equation as follows:

X_{{(t)}_{k}} = X_{b e s t_{k}} + r d \cdot e x p (\frac{i t e m}{M a x_i t e r} - 1) \cdot s i g n (r a n d - 0.5)

(9)

where

k (k = 1, \dots, D)

is the population dimension.

i t e r

and

M a x_i t e r

represent the current number of iterations and the maximum number of iterations, respectively. Where

s i g n (r a n d - 0.5)

is a symbolic function with a value of 1 or −1.

r d

represents the range of individual values in the population, and its expression equation is as follows:

r d = r a n d \cdot (u b - l b) + l b

(10)

u b

is the upper limit of the search space and

l b

is the lower limit of the search space. Algorithm 1 is the pseudo code for ROA.

Algorithm 1:ROA

Input: f(x): fitness function; N: individual number; d: function dimension; lb: maximum boundary; ub: minimum boundary;

M a x_i t e r

: maximum number of iteration

Output: global optimal value.

Initialize the related parameters of ROA

Randomly generate the positions of the insects

while iter<

M a x_i t e r

do

2.2. Transfer Function

The transfer function is a common method used in metaheuristic algorithms to convert from continuous variables into binary variables. We can use transfer functions to map a continuous variable to [0, 1] and then convert them into binary values by subsequent processing. It is reasonable and convenient to use transfer functions for binary operations.

In [43], Hu uses transfer functions to binarize the algorithm. As shown in Equation (11), where A denotes a vector of coefficients and D denotes a vector of distances between the current individual and the superior individual.

F (A D) = \frac{1}{1 + e^{- 10 (A D - 0.5)}}

(11)

After mapping through the transfer function in the above equation, we can use Equation (12) to iterate over the positions of individuals.

r a n d

denotes a random number that is uniformly distributed between 0 and 1.

X {(t + 1)}_{i} = \{\begin{matrix} 0, F (A D) \leq r a n d \\ 1, F (A D) > r a n d \end{matrix}

(12)

In this paper, we use four transfer function families to map continuous values. Herein, there are S-shaped, U-shaped, and V-shaped families of transfer functions are proposed by Mirjalili [53,54]. The expressions are given in the following Table 1, Table 2 and Table 3. Guo proposed a family of Z-shaped transfer functions in [55] and used them in binarization. The expressions are shown in Table 4.

The position update equation of the S-shaped transfer functions is (12), which means that the particle is more likely to become 0 when its velocity is small and 1 when its velocity is large. However, because the nature of the U-shaped, V-shaped, and Z-shaped transfer functions is different from that of S-shaped transfer functions, it is not possible to use the same position update method as the S-shaped transfer functions. Zhuang proposed the location update Equation (13). In this updated formula, when the individual’s speed is higher, there is a higher possibility of changing to a complementary position.

X (t + 1) = \{\begin{matrix} 1 - X (t), F (A D) > r a n d \\ X (t), F (A D) \leq r a n d \end{matrix}

(13)

3. Analysis and Proposed Parallel Binary ROA

In this section, a mathematical analysis of the search space of ROA is carried out. The initial transfer function is changed on the basis of mathematical analysis, and the algorithm is binarized. The algorithm is optimized using a parallel strategy, and two inter-group communication methods are proposed to improve the convergence speed of the algorithm and reduce the risk of premature convergence.

3.1. Mathematical Analysis

In the ROA algorithm, the location of the insect can be at any point in the search space, but in the binary ROA, its location can only be chosen between 0 and 1. Therefore, the search space of the original algorithm needs to be analyzed to obtain its specific range so that the corresponding transfer function can be better constructed to binarize the algorithm.

In order to make it easier to understand, we take one dimension of the individual to analyze. From (7) and (8), we know that

X_{i} = X_{i} + c v t + (X_{b e s t} - X_{i}) \cdot (1 - c) \cdot r a n d

, where

v = v_{1} + v_{2}

,

v_{1} = \frac{ω_{0}}{2} \sqrt{A^{2} s i n^{2} (ω_{0} t + θ) + B^{2} c o s^{2} (ω_{1} t + θ)}

,

v_{2} = v_{2} ω_{0} c o s (ω_{0} t + θ + ϕ)

. Furthermore, it is known that the values of

X_{i}

and

X_{b e s t}

can only be chosen between 0 or 1, so the final value of X has the four following cases.

(1): if $X_{i} = 0$ and $X_{b e s t} = 0$
$X_{i} = 0 + c v t + (0 - 0) \cdot (1 - c) \cdot r a n d = c v$
(2): if $X_{i} = 0$ and $X_{b e s t} = 1$
$X_{i} = 0 + c v t + (1 - 0) \cdot (1 - c) \cdot r a n d = c v + (1 - c) \cdot r a n d$
(3): if $X_{i} = 1$ and $X_{b e s t} = 0$
$X_{i} = 1 + c v t + (0 - 1) \cdot (1 - c) \cdot r a n d = 1 + c v - (1 - c) \cdot r a n d$
(4): if $X_{i} = 1$ and $X_{b e s t} = 1$
$X_{i} = 1 + c v t + (1 - 1) \cdot (1 - c) \cdot r a n d = 1 + c v$

We can know that c is a random number in (−1, 1) and

r a n d

is a random number between (0, 1). After the above analysis, we derive

v s . \in (- 0.156, 0.188)

, and conclude that the range of values of

X_{i} \in (- 1.188, 2.156)

. The conclusions we analyze here can be used for the improvement of the transfer function later.

3.2. New Transfer Functions

The transfer function is extremely important for the binary of the algorithm. In Section 2.2, we refer to four families of transfer functions to binarize the algorithm. This section will improve the transfer functions based on the mathematical analysis of the algorithm in Section 3.1.

The original transfer function is not fully applicable in this algorithm’s binary process because the search space to which the original transfer function is adapted does not match this algorithm. An acceptable transfer function allows individuals to converge to the optimum more quickly, while an unsuitable transfer function may cause the algorithm to converge slowly. We take S-shaped transfer functions as an example, and we hope that the greater the value of the individual in the process of position update, the greater the probability that the individual position will take the value of 1, and vice versa for the value of 0. From Section 3.1, we know that the search space of the ROA is [−1.188, 2.156], the center of the search space is 0.484, and the range of values of the

S_{1}

transfer function in the search space is [0.0850, 0.9868]. Therefore, we deform the transfer functions so that we can achieve better results, and the deformed S-shaped transfer functions are shown in Table 5.

Similar deformation operations are performed for other transfer function families, which enable the values that can be obtained in the original search space to be more evenly distributed between [0, 1], resulting in better results. The deformed transfer functions are shown in Table 6, Table 7 and Table 8. A comparison of the original and improved transfer function series is shown in Figure 2.

3.3. Parallel Strategy

Parallel strategy is often used in the optimization of algorithms. Through parallels, we can obtain better quality solutions and faster convergence. The concrete practice of a parallel strategy is to explore the optimal solution through grouping, and each group is independent of the other. Then, when certain conditions are met, communication between groups is carried out, thus completing the information exchange between groups.

In the parallel strategy used in this algorithm, we group the original population into a main population and two subpopulations. Subpopulation 1 selects the better individuals of the main population to search because optimal individual may appear in the vicinity of the better ones. Subpopulation 2 randomly selects individuals in the main population to search so as to prevent the population from falling into a local optimum.

The communication strategy is extremely important for the algorithm. A good communication method can make the algorithm obtain faster convergence and make the algorithm less likely to fall into the local optimum. In order to achieve faster convergence and improve the quality of the solution, two intergroup communication strategies are proposed. Strategy 1 compares the best individuals in the subpopulation with the best in the main population after every certain number of iterations, and if the individuals in the subpopulation are better, the worst individuals in the main population are replaced. This strategy can make the algorithm converge faster. Strategy 2 is to monitor the optimal individuals in the main population, and if no update is made for a long time, the algorithm is considered to be possibly stuck in a local optimum, and then, some individuals that are optimal in the main population are updated. The update is performed by combining the different dimensions of previously good individuals to form new individuals, and then replacing some of the best individuals in the main population with new ones. Figure 3 depicts the communication strategy for a parallel strategy.

4. Experiments, Results, and Analysis

In this section, we will use 23 benchmark functions to examine the exploration and development capabilities of the algorithm. The benchmark test function used, although specifically for evaluating continuous optimization algorithms, still has the original properties of the function when each dimension can only take binary values, so the benchmark test function can be used to detect binary optimization. In Table 9,

f_{1} - f_{7}

represents unimodal benchmark functions; in Table 10,

f_{8} - f_{13}

represents common multimodal benchmark functions; and in Table 11,

f_{14} - f_{23}

represents multimodal benchmark functions in low dimension.

D_{i m}

in the table is the dimension of the function,

f_{m i n}

denotes the minimum value that the function can obtain, and

s p a c e

denotes the search space of the function.

There is no optimal local solution in the unimodal function, but only one global optimal solution and the convergence speed of the algorithm can be measured by the single-peaked function. The common multimodal function is relatively complex, with multiple locally optimal solutions, so it can test whether the algorithm can dispose of the local optimal and thus reach the optimal global solution. Besides having multiple locally optimal solutions, the multimodal functions in low dimension with its too low dimensionality makes the algorithm more prone to prematureness, so it can strictly verify the convergence results of the algorithm.

Experimental Results

In experiments, we examine the impact of the improved transfer functions and the addition of the parallel strategy on the algorithm. Due to space limitation, we chose four transfer functions, namely

S_{3}, V_{2}, U_{3}, Z_{3}

, in four families of transfer functions as examples for our experiments. We also compared the improved algorithm with BPSO and BGWO to perform the experiments [42,43].

In our experiments, we will judge the algorithm’s merit by the quality of the solution and its stability. In the experiments, the algorithms were run 20 times on each benchmark function, and 150 iterations were passed during each run, with a total of 30 individuals in the population. Table 12 shows the experimental results when the algorithm uses the original transfer function, and Table 13 shows the experimental results when the algorithm uses the improved strategy. For the convenience of reading, the experimental data are retained to four decimal places using rounding.

As can be seen from Table 12, BROA_U3, BROA_V2, and BROA_Z3 have better results on the unimodal benchmark functions. This proves the good convergence performance of the ROA algorithm using the original V-shaped, U-shaped, and Z-shaped transfer function binarization. BGWO and BROA_Z3 achieved the best results on f8 and f13. BROA_U3, BROA_V2, and BROA_Z3 achieved the best performance on f9–f12, which indicates their good global exploration capability. In contrast, the ROA algorithm was optimized with the S-shaped transfer function and performs poorly. In the multimodal benchmark functions in low dimensions, all algorithms achieve better results.

Table 13 shows the experimental results of the algorithm after applying the improved strategy. For ease of reading, the improved algorithm is indicated in blue font if it is better than the original algorithm, and in red font if the improved algorithm is inferior to the original algorithm. The improved BROA_S3 achieves better results on both f1–f3 and f5–f13, indicating that the improved strategy is very effective on it, allowing it to obtain better convergence performance and more easily escape the local optimum. The improved BROA_V2 has better results on f5, f7, f8, f13; the improved BROA_U3 has better results on f7, f8, f13; and the improved BROA_V2 has better results on f5, f7. The differences between the fitness values of the improved algorithm and the original algorithm on the test functions are shown in Figure 4. On the figure, we can more clearly see the performance improvement due to the algorithm improvement. The above results show that the performance of the algorithm is significantly improved after applying the improved strategy. The parallel strategy of the algorithm allows it to converge to the optimum faster and effectively prevents the algorithm from falling into a local optimum. Among the algorithms that use different transfer functions for binarization, the improved strategy is the most effective in optimizing the algorithm for the binarization of the S-shaped transfer function.

In order to demonstrate that the improved algorithm is significantly different from the original algorithm, the Wilcoxon rank-sum test was performed on the results of the algorithm before and after improvement at the 5% significance level. We assume that there is no significant difference between the improved algorithm and the original algorithm. The obtained results are shown in Table 14. The original hypothesis is rejected if the value p in the table is less than 0.05, proving that the improved algorithm is significantly different from the one before improvement. From the table, we can see that PBROA_S3 has p-values less than 0.05 on f1, f2, f4, f5, f7; PBROA_V2 has p-values less than 0.05 on f7; PBROA_U3 has p-values less than 0.05 on f4, f7, f8; and PBROA_Z3 has p-values lower than 0.05 on f5, f7. From the results analyzed above, we know that there is a significant difference between the improved algorithm and the original algorithm. This further proves that the improved strategy enhances the performance of the algorithm.

5. Application of Feature Selection

Because the original data have a lot of redundancy, it makes the data processing difficult and time consuming. Feature selection is a very important data pre-processing approach, where the raw data are initially processed by feature selection, and the processed data become more accurate and streamlined. In this section, we apply the improved algorithm to feature selection.

5.1. Dataset

The experiments test the performance of the algorithm using 13 datasets. These datasets are all from the UCI machine learning repository and vary in number and dimensionality [56]. The specific parameters of the datasets are shown in Table 15.

5.2. Experimental Results and Analysis

KNN is a simple method for data classification in data mining, which is based on the principle that each individual type can be represented by its K-nearest neighbors [57]. The calculation of distance in KNN generally uses the Manhattan distance or the Euclidean distance. The specific calculation is as in Equation (14)

D_{p} (x, x_{t e s t}) = {(\sum_{i = 1}^{n} {| x_{_{i}} - x_{t e s t_{i}} |}^{p})}^{\frac{1}{p}}

(14)

where p can be taken as 1 or 2; when

p = 1

, this equation indicates the calculation of Manhattan distance, and when

p = 2

, this equation indicates the calculation of Euclidean distance. Where x denotes training data,

x_{t e s t}

denotes the test data, and n denotes the dimensionality of the data.

In this experiment, the K-fold cross-validation is adopted, the original dataset is divided into K parts, and K-1 parts are used as training data each time, leaving one datum as the test datum. After repeating K experiments, the final result is taken as the average of K experiments. In the experiment, we should balance the classification accuracy and the number of features selected. The purpose of feature selection is to streamline the experimental data, so we want to obtain a lower classification error and, at the same time, select as few features as possible. The experimental error is calculated as shown in Equation (15)

f i t n e s s = a \cdot e r r o r + (1 - a) \frac{f l a g}{d i m}

(15)

where

e r r o r

denotes the classification error, a denotes the proportion of the classification error in the fitness, and in this experiment, a takes the value of 0.99.

f l a g

denotes the number of selected features, and

d i m

denotes the total number of features in this dataset.

Euclidean and 5 are the values of the parameters of Distance and NumNeighbors in KNN, and the value of K-fold parameter in cross validation is set to 2 in this experiment. In the experiments, the number of all populations was set to 30, the number of iterations was set to 150, and it was run 20 times on each dataset. In the algorithm used in the experiments, when the individual takes the value of 0, it means that the current top feature is not selected, and when the individual takes the value of 1, this means that the current top feature is selected.

The data in Table 16 are the fitness values, where the best data are indicated in bold for ease of reading. In addition, we also use Figure 5 to show the experimental results, which makes the comparison of the experimental results clearer. From the data in the table and figure, we can see that BROA_Z3 achieves the best results for the datasets Breast Cancer, Breast Cancer Wisconsin, and Glass. For the dataset Iris, several algorithms achieve better results due to its low dimensionality. For BGWO, it performs well in the datasets South German Credit, Flags, and Image Segmentation. BROA_S3 has good performance in the datasets Dermatology, Credit Approval, Chess(kr-vs-kp), Hepatitis, Statlog (Australian Credit Approval), and Wall-Following Robot Navigation datasets. These results show the superiority of the parallel binary ROA algorithm, which can outperform BGWO and BPSO algorithms in most cases, and BROA_S3 has higher accuracy among several improved algorithms. Compared with the traditional algorithm, the new algorithm proposed in this paper has a better convergence performance, it does not easily fall into local optimum, and tends to find better solutions. The above results also confirm the superiority of the improved parallel optimization algorithm in practical applications, where the parallel strategy can explore more cases and obtain better results.

The data in Table 17 are the number of selected features, where the best data are indicated in bold for the ease of reading. To make the experimental effect more understandable, we also show the experimental results with Figure 6. From the data in the table and figure, we can see that, for the dataset Iris, several algorithms give the best results. BPSO gives the best results for the dataset Breast Cancer, Hepatitis. BROA_S3 gives the best results for the dataset Dermatology. BROA_V2 selected fewer features on the datasets South German Credit, Flags, Credit Approval, Image Segmentation, and Statlog (Australian Credit Approval). BROA_U3 has better results on the datasets Wine and Wall-Following Robot Navigation. BROA_Z3 selected fewer features on the dataset Breast Cancer Wisconsin, Glass, Chess (kr-vs-kp). From the analysis of the above results, it is known that BROA_S3 is more accurate but biased in the selection of more features, and BROA_V2 will be more biased to select fewer features to achieve classification.

6. Conclusions

The parallel binary Rafflesia Optimization Algorithm is able to solve discrete problems, and in this paper, it is applied to the problem of feature selection with good results. In this paper, the transfer function is crucial in the binary algorithm. We analyze the search space of the original algorithm and propose new transfer functions based on it, thus binarizing the original algorithm. To further improve the performance of the algorithm, a parallel strategy is used to optimize the algorithm. The remarkable performance of the parallel binary Rafflesia Optimization Algorithm is demonstrated by running on the benchmark functions. Finally, we successfully applied the algorithm to the problem of feature selection and achieved the classification of features by KNN and cross-validation methods. Parallel binary ROA has been shown to possess acceptable classification accuracy in experiments. In this paper, we only used KNN to implement feature selection, which can be combined with a neural network for classification in the future and other ways to reduce the classification error. The binary conversion of the algorithm also increases the computation time, and future work will also include reducing the running time of the algorithm.

Author Contributions

Conceptualization, J.-S.P. and S.-C.C.; Data curation, P.H.; Formal analysis, J.-S.P. and S.-C.C.; Investigation, J.-S.P. and H.-J.S.; Methodology, J.-S.P., H.-J.S., S.-C.C. and P.H.; Resources, H.-J.S. and P.H.; Software, H.-J.S. and S.-C.C.; Validation, J.-S.P., S.-C.C. and H.A.S.; Writing—original draft, H.-J.S.; Writing—review and editing, J.-S.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lin, J.C.W.; Wu, T.Y.; Fournier-Viger, P.; Lin, G.; Zhan, J.; Voznak, M. Fast algorithms for hiding sensitive high-utility itemsets in privacy-preserving utility mining. Eng. Appl. Artif. Intell. 2016, 55, 269–284. [Google Scholar] [CrossRef]
Romero, C.; Ventura, S. Data mining in education. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2013, 3, 12–27. [Google Scholar] [CrossRef]
Chen, C.M.; Chen, L.; Gan, W.; Qiu, L.; Ding, W. Discovering high utility-occupancy patterns from uncertain data. Inf. Sci. 2021, 546, 1208–1229. [Google Scholar] [CrossRef]
Karabulut, E.M.; Özel, S.A.; Ibrikci, T. A comparative study on the effect of feature selection on classification accuracy. Procedia Technol. 2012, 1, 323–327. [Google Scholar] [CrossRef]
Kumar, V.; Minz, S. Feature selection: A literature review. SmartCR 2014, 4, 211–229. [Google Scholar] [CrossRef]
Chandrashekar, G.; Sahin, F. A survey on feature selection methods. Comput. Electr. Eng. 2014, 40, 16–28. [Google Scholar] [CrossRef]
Meng, Z.; Pan, J.S.; Tseng, K.K. PaDE: An enhanced Differential Evolution algorithm with novel control parameter adaptation schemes for numerical optimization. Knowl.-Based Syst. 2019, 168, 80–99. [Google Scholar] [CrossRef]
Xue, X.; Chen, J.; Yao, X. Efficient user involvement in semiautomatic ontology matching. IEEE Trans. Emerg. Top. Comput. Intell. 2018, 5, 214–224. [Google Scholar] [CrossRef]
Sun, C.; Jin, Y.; Cheng, R.; Ding, J.; Zeng, J. Surrogate-assisted cooperative swarm optimization of high-dimensional expensive problems. IEEE Trans. Evol. Comput. 2017, 21, 644–660. [Google Scholar] [CrossRef]
Wang, H.; Rahnamayan, S.; Sun, H.; Omran, M.G. Gaussian bare-bones differential evolution. IEEE Trans. Cybern. 2013, 43, 634–647. [Google Scholar] [CrossRef]
Salehi, S.; Selamat, A.; Mashinchi, M.R.; Fujita, H. The synergistic combination of particle swarm optimization and fuzzy sets to design granular classifier. Knowl.-Based Syst. 2015, 76, 200–218. [Google Scholar] [CrossRef]
Wang, J.; Liu, Y.; Chen, J.; Yang, X. An Ensemble Framework to Forest Optimization Based Reduct Searching. Symmetry 2022, 14, 1277. [Google Scholar] [CrossRef]
Jain, M.; Saihjpal, V.; Singh, N.; Singh, S.B. An Overview of Variants and Advancements of PSO Algorithm. Appl. Sci. 2022, 12, 8392. [Google Scholar] [CrossRef]
Meng, Z.; Zhong, Y.; Mao, G.; Liang, Y. PSO-sono: A novel PSO variant for single-objective numerical optimization. Inf. Sci. 2022, 586, 176–191. [Google Scholar] [CrossRef]
Pervaiz, S.; Bangyal, W.H.; Ashraf, A.; Nisar, K.; Haque, M.R.; Ibrahim, A.; Ag, A.B.; Chowdhry, B.; Rasheed, W.; Rodrigues, J.J.; et al. Comparative Research Directions of Population Initialization Techniques using PSO Algorithm. Intell. Autom. Soft Comput. 2022, 32, 1427–1444. [Google Scholar] [CrossRef]
Feng, H.M. Self-generation RBFNs using evolutional PSO learning. Neurocomputing 2006, 70, 241–251. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Guo, F.; Zhang, H.; Xu, Y.; Xiong, G.; Zeng, C. Isokinetic Rehabilitation Trajectory Planning of an Upper Extremity Exoskeleton Rehabilitation Robot Based on a Multistrategy Improved Whale Optimization Algorithm. Symmetry 2023, 15, 232. [Google Scholar] [CrossRef]
Li, Y.; Li, X.; Liu, J.; Tu, X. Gaussian perturbation whale optimization algorithm based on nonlinear strategy. Int. J. Perform. Eng. 2019, 15, 1829. [Google Scholar] [CrossRef]
Aljarah, I.; Faris, H.; Mirjalili, S. Optimizing connection weights in neural networks using the whale optimization algorithm. Soft Comput. 2018, 22, 1–15. [Google Scholar] [CrossRef]
Wu, T.Y.; Lin, J.C.W.; Zhang, Y.; Chen, C.H. A grid-based swarm intelligence algorithm for privacy-preserving data mining. Appl. Sci. 2019, 9, 774. [Google Scholar] [CrossRef]
Chu, S.C.; Tsai, P.W.; Pan, J.S. Cat swarm optimization. In Proceedings of the Pacific Rim International Conference on Artificial Intelligence, Guilin, China, 7–11 August 2006; pp. 854–858. [Google Scholar]
Meng, Z.; Yang, C. Two-stage differential evolution with novel parameter control. Inf. Sci. 2022, 596, 321–342. [Google Scholar] [CrossRef]
Song, P.C.; Chu, S.C.; Pan, J.S.; Yang, H. Simplified Phasmatodea population evolution algorithm for optimization. Complex Intell. Syst. 2022, 8, 2749–2767. [Google Scholar] [CrossRef]
Pan, J.S.; Zhang, L.G.; Wang, R.B.; Snášel, V.; Chu, S.C. Gannet optimization algorithm: A new metaheuristic algorithm for solving engineering optimization problems. Math. Comput. Simul. 2022, 202, 343–373. [Google Scholar] [CrossRef]
Nguyen, T.-T.; Dong-Nguyen, T.; Ngo, T.-G.; Nguyen, V.-T. An Optimal Thresholds for Segmenting Medical Images Using Improved Swarm Algorithm. J. Inf. Hiding Multimed. Signal Process. 2022, 13, 12–21. [Google Scholar]
Xue, X.; Jiang, C. Matching sensor ontologies with multi-context similarity measure and parallel compact differential evolution algorithm. IEEE Sens. J. 2021, 21, 24570–24578. [Google Scholar] [CrossRef]
Wang, G.; Guo, L.; Wang, H.; Duan, H.; Liu, L.; Li, J. Incorporating mutation scheme into krill herd algorithm for global numerical optimization. Neural Comput. Appl. 2014, 24, 853–871. [Google Scholar] [CrossRef]
Wang, G.G.; Hao, G.S.; Cheng, S.; Cui, Z. An improved monarch butterfly optimization with equal partition and f/t mutation. In Proceedings of the Advances in Swarm Intelligence: 8th International Conference, ICSI 2017, Fukuoka, Japan, 27 July–1 August 2017; pp. 106–115. [Google Scholar]
Deng, W.; Xu, J.; Zhao, H. An improved ant colony optimization algorithm based on hybrid strategies for scheduling problem. IEEE Access 2019, 7, 20281–20292. [Google Scholar] [CrossRef]
Arasteh, B.; Seyyedabbasi, A.; Rasheed, J.; Abu-Mahfouz, A.M. Program Source-Code Re-Modularization Using a Discretized and Modified Sand Cat Swarm Optimization Algorithm. Symmetry 2023, 15, 401. [Google Scholar] [CrossRef]
Pan, J.S.; Fu, Z.; Hu, C.C.; Tsai, P.W.; Chu, S.C. Rafflesia Optimization Algorithm Applied in the Logistics Distribution Centers Location Problem. J. Internet Technol. 2022, 23, 1541–1555. [Google Scholar]
Trong-The Nguyen, T.D.N.; Nguyen, V.T. An Optimizing Pulse Coupled Neural Network based on Golden Eagle Optimizer for Automatic Image Segmentation. J. Inf. Hiding Multimed. Signal Process. 2022, 13, 155–164. [Google Scholar]
Xue, X. A compact firefly algorithm for matching biomedical ontologies. Knowl. Inf. Syst. 2020, 62, 2855–2871. [Google Scholar] [CrossRef]
Chen, X.; Cheng, L.; Liu, C.; Liu, Q.; Liu, J.; Mao, Y.; Murphy, J. A WOA-based optimization approach for task scheduling in cloud computing systems. IEEE Syst. J. 2020, 14, 3117–3128. [Google Scholar] [CrossRef]
Ramachandran, M.; Ganesh, E. Energy Optimized Joint Channel Assignment and Routing using Cat Swarm Optimization (CSO) Algorithm in CRAHN. J. Green Eng. 2020, 202, 3434–3449. [Google Scholar]
Gao, D.; Wang, G.G.; Pedrycz, W. Solving fuzzy job-shop scheduling problem using DE algorithm improved by a selection mechanism. IEEE Trans. Fuzzy Syst. 2020, 28, 3265–3275. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Yang, X.S. Binary bat algorithm. Neural Comput. Appl. 2014, 25, 663–681. [Google Scholar] [CrossRef]
Mirjalili, S.; Hashim, S.Z.M. BMOA: Binary magnetic optimization algorithm. Int. J. Mach. Learn. Comput. 2012, 2, 204. [Google Scholar] [CrossRef]
Hussien, A.G.; Oliva, D.; Houssein, E.H.; Juan, A.A.; Yu, X. Binary whale optimization algorithm for dimensionality reduction. Mathematics 2020, 8, 1821. [Google Scholar] [CrossRef]
Akan, T.; Agahian, S.; Dehkharghani, R. Binbro: Binary battle royale optimizer algorithm. Expert Syst. Appl. 2022, 195, 116599. [Google Scholar]
Kennedy, J.; Eberhart, R.C. A discrete binary version of the particle swarm algorithm. In Proceedings of the 1997 IEEE International Conference on Systems, Man, and Cybernetics, Computational Cybernetics and Simulation, Orlando, FL, USA, 12–15 October 1997; Volume 5, pp. 4104–4108. [Google Scholar]
Hu, P.; Pan, J.S.; Chu, S.C. Improved binary grey wolf optimizer and its application for feature selection. Knowl.-Based Syst. 2020, 195, 105746. [Google Scholar] [CrossRef]
Song, Y.; Wu, D.; Deng, W.; Gao, X.Z.; Li, T.; Zhang, B.; Li, Y. MPPCEDE: Multi-population parallel co-evolutionary differential evolution for parameter optimization. Energy Convers. Manag. 2021, 228, 113661. [Google Scholar] [CrossRef]
Biscani, F.; Izzo, D. A parallel global multiobjective framework for optimization: Pagmo. J. Open Source Softw. 2020, 5, 2338. [Google Scholar] [CrossRef]
Pan, J.S.; Sun, B.; Chu, S.C.; Zhu, M.; Shieh, C.S. A Parallel Compact Gannet Optimization Algorithm for Solving Engineering Optimization Problems. Mathematics 2023, 11, 439. [Google Scholar] [CrossRef]
Ding, S.; Du, W.; Zhao, X.; Wang, L.; Jia, W. A new asynchronous reinforcement learning algorithm based on improved parallel PSO. Appl. Intell. 2019, 49, 4211–4222. [Google Scholar] [CrossRef]
Khankhour, H.; Abdoun, O.; Abouchabaka, J. Parallel genetic approach for routing optimization in large ad hoc networks. Int. J. Electr. Comput. Eng. (IJECE) 2022, 12, 748–755. [Google Scholar] [CrossRef]
Schutte, J.F.; Reinbolt, J.A.; Fregly, B.J.; Haftka, R.T.; George, A.D. Parallel global optimization with the particle swarm algorithm. Int. J. Numer. Methods Eng. 2004, 61, 2296–2315. [Google Scholar] [CrossRef]
Rashedi, E.; Nezamabadi-Pour, H.; Saryazdi, S. BGSA: Binary gravitational search algorithm. Nat. Comput. 2010, 9, 727–745. [Google Scholar] [CrossRef]
Abdel-Basset, M.; Mohamed, R.; Mirjalili, S. A binary equilibrium optimization algorithm for 0–1 knapsack problems. Comput. Ind. Eng. 2021, 151, 106946. [Google Scholar] [CrossRef]
Chu, S.C.; Zhuang, Z.; Li, J.; Pan, J.S. A novel binary QUasi-affine transformation evolutionary (QUATRE) algorithm. Appl. Sci. 2021, 11, 2251. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. S-shaped versus V-shaped transfer functions for binary particle swarm optimization. Swarm Evol. Comput. 2013, 9, 1–14. [Google Scholar] [CrossRef]
Mirjalili, S.; Zhang, H.; Mirjalili, S.; Chalup, S.; Noman, N. A novel U-shaped transfer function for binary particle swarm optimisation. In Soft Computing for Problem Solving 2019; Springer: Cham, Switzerland, 2020; pp. 241–259. [Google Scholar]
Guo, S.; Wang, J.; Guo, M. Z-shaped transfer functions for binary particle swarm optimization algorithm. Comput. Intell. Neurosci. 2020, 2020, 6502807. [Google Scholar] [CrossRef] [PubMed]
Asuncion, A.; Newman, D. UCI machine learning repository, 2007.
Zhang, S.; Li, X.; Zong, M.; Zhu, X.; Wang, R. Efficient kNN classification with different numbers of nearest neighbors. IEEE Trans. Neural Netw. Learn. Syst. 2017, 29, 1774–1785. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The model of the calculated dimensions.

Figure 2. The original (solid lines) and improved (dotted lines) families of transfer functions.

Figure 3. Communication strategy for parallel strategy.

Figure 4. The difference between algorithm fitness before and after applying the improved strategy.

Figure 5. The result of fitness value.

Figure 6. The number of selected features.

Table 1. S-shaped families of transfer functions.

Function Name	Formula
$S_{1} (x)$	$\frac{1}{1 + e^{- 2 x}}$
$S_{2} (x)$	$\frac{1}{1 + e^{- x}}$
$S_{3} (x)$	$\frac{1}{1 + e^{- x / 2}}$
$S_{4} (x)$	$\frac{1}{1 + e^{- x / 3}}$

Table 2. V-shaped families of transfer functions.

Function Name	Formula
$V_{1} (x)$	$\| e r f (\frac{\sqrt{π}}{2} x) \|$
$V_{2} (x)$	$\| t a n h (x) \|$
$V_{3} (x)$	$\| \frac{x}{\sqrt{1 + x^{2}}} \|$
$V_{4} (x)$	$\| \frac{2}{π} a r c t a n (\frac{2}{π} x) \|$

Table 3. U-shaped families of transfer functions.

Function Name	Formula
$U_{1} (x)$	$m i n (\| x^{1.5} \|, 1)$
$U_{2} (x)$	$m i n (\| x^{2} \|, 1)$
$U_{3} (x)$	$m i n (\| x^{3} \|, 1)$
$U_{4} (x)$	$m i n (\| x^{4} \|, 1)$

Table 4. Z-shaped families of transfer functions.

Function Name	Formula
$Z_{1} (x)$	$\sqrt{1 - 2^{x}}$
$Z_{2} (x)$	$\sqrt{1 - 5^{x}}$
$Z_{3} (x)$	$\sqrt{1 - 8^{x}}$
$Z_{4} (x)$	$\sqrt{1 - 20^{x}}$

Table 5. Improved S-shaped families of transfer functions.

Function Name	Formula
$S_{1} (x)$	$\frac{1}{1 + e^{- 2 (x - 0.484)}}$
$S_{2} (x)$	$\frac{1}{1 + e^{- 5 (x - 0.484)}}$
$S_{3} (x)$	$\frac{1}{1 + e^{- 10 (x - 0.484)}}$
$S_{3} (x)$	$\frac{1}{1 + e^{- 15 (x - 0.484)}}$

Table 6. Improved V-shaped families of transfer functions.

Function Name	Formula
$V_{1} (x)$	$\| e r f (\frac{\sqrt{π}}{2} (4 \cdot (x - 0.484))) \|$
$V_{2} (x)$	$\| t a n h (3 \cdot (x - 0.484)) \|$
$V_{3} (x)$	$\| \frac{\sqrt{1 + 1 . 672^{2}}}{1.672} \cdot \frac{(x - 0.484)}{\sqrt{1 + {(x - 0.484)}^{2}}} \|$
$V_{4} (x)$	$\| \frac{1}{\frac{2}{π} * a r c t a n (\frac{π}{2} \cdot 5.016)} \cdot \frac{2}{π} a r c t a n (\frac{6}{π} (x - 0.484)) \|$

Table 7. Improved U-shaped families of transfer functions.

Function Name	Formula
$U_{1} (x)$	$m i n (\| \frac{{(x - 0.484)}^{1.5}}{0 . 516^{1.5}} \|, 1)$
$U_{2} (x)$	$m i n (\| \frac{{(x - 0.484)}^{2}}{0 . 516^{2}} \|, 1)$
$U_{3} (x)$	$m i n (\| \frac{{(x - 0.484)}^{3}}{0 . 516^{3}} \|, 1)$
$U_{4} (x)$	$m i n (\| \frac{{(x - 0.484)}^{4}}{0 . 516^{4}} \|, 1)$

Table 8. Improved Z-shaped families of transfer functions.

Function Name	Formula
$Z_{1} (x)$	$\frac{\sqrt{1 - 2^{x - 0.484}}}{\sqrt{1 - 2^{- 1.602}}}$
$Z_{2} (x)$	$\frac{\sqrt{1 - 5^{x - 0.484}}}{\sqrt{1 - 5^{- 1.602}}}$
$Z_{3} (x)$	$\frac{\sqrt{1 - 8^{x - 0.484}}}{\sqrt{1 - 8^{- 1.602}}}$
$Z_{4} (x)$	$\frac{\sqrt{1 - 20^{x - 0.484}}}{\sqrt{1 - 20^{- 1.602}}}$

Table 9. Unimodal benchmark functions.

Name	Function	Space	$D_{im}$
Sphere	$f_{1} (x) = \sum_{i = 1}^{n} x_{i}^{2}$	[−100, 100]	30
Schwefel’s function 2.21	$f_{2} (x) = \sum_{i = 1}^{n} \| x_{i} \| + \prod_{i = 1}^{n} \| x_{i} \|$	[−10, 10]	30
Schwefel’s function 1.2	$f_{3} (x) = \sum_{i = 1}^{n} {(\sum_{j - 1}^{i} x_{j})}^{2}$	[−100, 100]	30
Schwefel’s function 2.22	$f_{4} (x) = m a x_{i} {\| x_{i} \|, 1 \leq i \leq n}$	[−100, 100]	30
Rosenbrock	$f_{5} (x) = \sum_{i = 1}^{n - 1} [100 (x_{i + 1} - x_{i}^{2}) + {(x_{i} - 1)}^{2}]$	[−30, 30]	30
Step	$f_{6} (x) = \sum_{i = 1}^{n} {([x_{i} + 0.5])}^{2}$	[−100, 100]	30
Dejong’s noisy	$f_{7} (x) = \sum_{i = 1}^{n} i x_{i}^{4} + r a n d o m (0, 1)$	[−1.28, 1.28]	30

Table 10. Common multimodal benchmark functions.

Name	Function	Space	$D_{im}$	$f_{\min}$
Schwefel	$f_{8} (x) = \sum_{i = 1}^{n} - x_{i} s i n (\sqrt{\| x_{i} \|})$	[−500, 500]	30	−12,569
Rastringin	$f_{9} (x) = \sum_{i = 1}^{n} [x_{i}^{2} - 10 c o s (2 π x_{i}) + 10]$	[−5.12, 5.12]	30	0
Ackley	$f_{10} (x) = - 20 e x p (- 0.2 \sqrt{\frac{1}{n} \sum_{i = 1}^{n} x_{i}^{2}}) -$	[−32, 32]	30	0
	$e x p (\frac{1}{n} \sum_{i = 1}^{n} c o s (2 π x_{i})) + 20 + e$
Griewank	$f_{11} (x) = \frac{1}{4000} \sum_{i = 1}^{n} x_{i}^{2} - \prod_{i = 1}^{n} c o s (\frac{x_{i}}{\sqrt{i}}) + 1$	[−600, 600]	30	0
Generalized penalized 1	$f_{12} (x) = \frac{π}{n} {10 s i n (π y_{1}) + \sum_{i = 1}^{n - 1} {(y_{i} - 1)}^{2} [1 +$	[−50, 50]	30	0
	$10 s i n^{2} (π y_{i + 1})] + {(y_{n} - 1)}^{2}} + \sum_{i = 1}^{n} u (x_{i}, 10, 100, 4)$
	$y_{i} = 1 + \frac{x_{i} + 1}{4} u (x_{i}, a, k, m) =$
	$\{\begin{matrix} k {(x_{i} - a)}^{m} x_{i} > a \\ 0, - a < x_{i} < a \\ k {(- x_{i} - a)}^{m} x_{i} < - a \end{matrix}$
Generalized penalized 2	$f_{13} (x) = 0.1 {s i n^{2} (3 π x_{1}) +$	[−50, 50]	30	0
	$\sum_{i = 1}^{n} {(x_{i} - 1)}^{2} [1 + s i n^{2} (3 π x_{i} + 1)] +$
	${(x_{n} - 1)}^{2} [1 + s i n^{2} (2 π x_{n})]} +$
	$\sum_{i = 1}^{n} u (x_{i}, 10, 100, 4)$

Table 11. Multimodal benchmark functions in low dimension.

Name	Function	Space	$D_{im}$	$f_{\min}$
Fifth of Dejong	$f_{14} (x) = (\frac{1}{500} \sum_{j = 1}^{25} \frac{1}{j + \sum_{i = 1}^{2} {(x_{i} - a_{i j})}^{6}})$	[−65, 65]	2	1
Kowalik	$f_{15} (x) = \sum_{i = 1}^{1} 1 {[a_{i} - \frac{x_{1} (b_{i}^{2} + b_{i} x_{2})}{b_{i}^{2} + b_{i} x_{3} + x_{4}}]}^{2}$	[−5, 5]	4	0.00030
Six-hump camel back	$f_{16} (x) = 4 x_{1}^{2} - 2.1 x_{1}^{4} + \frac{1}{3} x_{1}^{6} + x_{1} x_{2} - 4 x_{2}^{2} + 4 x_{2}^{4}$	[−5, 5]	2	−1.0316
Branins	$f_{17} (x) = {(x_{2} - \frac{5.1}{4 π^{2}} x_{1}^{2} + \frac{5}{π} x_{1} - 6)}^{2} + 10 (1 - \frac{1}{8 π}) c o s x_{1} + 10$	[-5, 5]	2	0.398
Goldstein–Price	$f_{18} (x) = [1 + {(x_{1} + x_{2} + 1)}^{2} (19 - 14 x_{1} + 3 x_{1}^{2} - 14 x_{2} + 6 x_{1} x_{2} + 3 x_{2}^{2})]$	[−2, 2]	2	3
	$\times [30 + {(2 x_{1} - 3 x_{2})}^{2} \times (18 - 32 x_{1} + 12 x_{1}^{2} + 48 x_{2} - 36 x_{1} x_{2} + 27 x_{2}^{2})]$
Hartman 1	$f_{19} (x) = - \sum_{i = 1}^{4} c_{i} e x p (- \sum_{j = 1}^{3} a_{i j} {(x_{j} - p_{i j})}^{2})$	[1, 3]	3	−3.86
Hartman 2	$f_{20} (x) = - \sum_{i = 1}^{4} c_{i} e x p (- \sum_{j = 1}^{6} a_{i j} {(x_{j} - p_{i j})}^{2})$	[0, 1]	6	−3.32
Shekel 1	$f_{21} (x) = - \sum_{i = 1}^{5} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	[0, 10]	4	−10.1532
Shekel 2	$f_{22} (x) = - \sum_{i = 1}^{7} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	[0, 10]	4	−10.4028
Shekel 3	$f_{23} (x) = - \sum_{i = 1}^{10} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	[0, 10]	4	−10.5363

Table 12. The statistical results of the original transfer function.

Function	BPSO		BGWO		BROA_S3		BROA_V2		BROA_U3		BROA_Z3
Function	adv	std	adv	std	adv	std	adv	std	adv	std	adv	std
f1	0.9333	0.6397	3.1000	1.2959	0.0667	0.2537	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f2	0.7333	0.5833	2.8000	1.2149	0.1667	0.3790	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f3	8.0333	6.6978	117.4667	103.4004	0.4667	1.2243	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f4	1.0000	0.0000	1.0000	0.0000	1.0000	0.0000	0.8000	0.4068	0.0000	0.0000	1.0000	0.0000
f5	259.8333	112.4192	16.9667	38.5920	31.0333	48.3432	30.0333	18.1668	29.0000	0.0000	34.7667	41.6270
f6	8.7000	1.1265	13.2333	2.1485	8.1667	1.0933	7.5000	0.0000	7.5000	0.0000	7.5000	0.0000
f7	4.7210	3.0065	35.4002	19.3117	0.7211	1.5594	0.0006	0.0006	0.0005	0.0004	0.0010	0.0011
f8	−24.5149	0.3653	−25.2441	0.0000	−25.1880	0.2135	−25.1319	0.2909	−25.0758	0.3423	−25.2441	0.0000
f9	1.0000	0.6433	2.5667	1.5241	0.2333	0.4302	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f10	0.5548	0.3511	1.3355	0.2614	0.0239	0.1309	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f11	0.0223	0.0130	0.1358	0.0601	0.0025	0.0066	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f12	1.7746	0.0771	2.0588	0.1769	1.6790	0.0382	1.6690	0.0000	1.6690	0.0000	1.6690	0.0000
f13	0.1000	0.0643	0.0000	0.0000	0.0067	0.0254	0.0200	0.0407	0.0067	0.0254	0.0000	0.0000
f14	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000
f15	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000
f16	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f17	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000
f18	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000
f19	−0.3348	0.0000	−0.3337	0.0063	−0.3348	0.0000	−0.3348	0.0000	−0.3348	0.0000	−0.3348	0.0000
f20	−0.1657	0.0000	−0.1334	0.0554	−0.1657	0.0000	−0.1657	0.0000	−0.1657	0.0000	−0.1657	0.0000
f21	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000
f22	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000
f23	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000

Table 13. Statistical results of applying improvement strategies.

Function	BPSO		BGWO		BROA_S3		BROA_V2		BROA_U3		BROA_Z3
Function	adv	std	adv	std	adv	std	adv	std	adv	std	adv	std
f1	0.9333	0.6397	3.1000	1.2959	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f2	0.7333	0.5833	2.8000	1.2149	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f3	8.0333	6.6978	117.4667	103.4004	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f4	1.0000	0.0000	1.0000	0.0000	1.0000	0.0000	0.8667	0.3457	0.0000	0.0000	1.0000	0.0000
f5	259.8333	112.4192	16.9667	38.5920	3.5000	19.1703	28.0333	5.2947	29.0000	0.0000	29.4667	24.1400
f6	8.7000	1.1265	13.2333	2.1485	7.5000	0.0000	7.5000	0.0000	7.5000	0.0000	7.5000	0.0000
f7	4.7210	3.0065	35.4002	19.3117	0.0004	0.0003	0.0005	0.0005	0.0003	0.0004	0.0008	0.0008
f8	−24.5149	0.3653	−25.2441	0.0000	−25.2441	0.0000	−25.2441	0.0000	−25.2161	0.1536	−25.2441	0.0000
f9	1.0000	0.6433	2.5667	1.5241	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f10	0.5548	0.3511	1.3355	0.2614	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f11	0.0223	0.0130	0.1358	0.0601	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f12	1.7746	0.0771	2.0588	0.1769	1.6690	0.0000	1.6690	0.0000	1.6690	0.0000	1.6690	0.0000
f13	0.1000	0.0643	0.0000	0.0000	0.0000	0.0000	0.0033	0.0183	0.0000	0.0000	0.0000	0.0000
f14	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000	12.6705	0.0000
f15	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000	0.1484	0.0000
f16	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f17	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000	27.7029	0.0000
f18	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000	600.0000	0.0000
f19	−0.3348	0.0000	−0.3337	0.0063	−0.3348	0.0000	−0.3348	0.0000	−0.3348	0.0000	−0.3348	0.0000
f20	−0.1657	0.0000	−0.1334	0.0554	−0.1657	0.0000	−0.1657	0.0000	−0.1657	0.0000	−0.1657	0.0000
f21	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000	−5.0552	0.0000
f22	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000	−5.0877	0.0000
f23	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000	−5.1285	0.0000

Table 14. p-value of the Wilcoxon rank-sum test.

Function	PBROA_S3	PBROA_V2	PBROA_U3	PBROA_Z3
f1	0.0156	1	1	1
f2	0.0425	1	1	1
f3	0.2500	1	1	1
f4	0.0001	0.2500	0.0000	1
f5	0.0080	0.8659	0.7465	0.0021
f6	0.0625	1	1	1
f7	0.0013	0.0023	0.0034	0.0033
f8	1	0.2500	0.0425	1
f9	1	1	1	1
f10	0.2500	1	1	1
f11	0.1250	1	1	0.2500
f12	0.1250	1	1	1
f13	1	0.1250	0.0335	1
f14	1	1	1	1
f15	1	1	1	1
f16	1	1	1	1
f17	1	1	1	1
f18	1	1	1	1
f19	1	1	1	1
f20	1	1	1	1
f21	1	1	1	1
f22	1	1	1	1
f23	1	1	1	1

Table 15. Information about the testing datasets.

Dataset	Instances	Number of Features	Number of Categories	Attribute Types
Breast Cancer	284	9	2	Categorical
Breast Cancer Wisconsin	699	9	6	Integer
Glass	214	9	6	Real
Iris	150	4	3	Real
South German Credit	1000	20	4	Integer, real
Wine	178	13	3	Integer, real
Flags	194	28	8	Categorical, integer
Dermatology	366	33	6	Categorical, integer
Credit Approval	690	15	2	Categorical, integer, real
Chess(kr-vs-kp)	3196	36	2	Categorical
Hepatitis	155	19	2	Categorical, integer, real
Image Segmentation	210	19	7	Real
Statlog (Australian Credit Approval)	690	14	2	Categorical, integer, real
Wall-Following Robot Navigation Data	5456	25	4	Real

Table 16. The result of fitness value.

Dataset	BPSO	BGWO	PBROA_S3	PBROA_V2	PBROA_U3	PBROA_Z3
Breast Cancer	0.1094	0.1030	0.1020	0.1039	0.1079	0.1010
Breast Cancer Wisconsin	0.0042	0.0046	0.0037	0.0040	0.0041	0.0036
Glass	0.1273	0.1258	0.1279	0.1214	0.1283	0.1208
Iris	0.0025	0.0029	0.0025	0.0025	0.0025	0.0025
South German Credit	0.4647	0.4421	0.4544	0.4569	0.4580	0.4594
Wine	0.0020	0.0034	0.0018	0.0019	0.0017	0.0018
Flags	0.2239	0.1877	0.2068	0.2197	0.2201	0.2120
Dermatology	0.0035	0.0046	0.0031	0.0032	0.0032	0.0035
Credit Approval	0.0817	0.0817	0.0704	0.0784	0.0780	0.0748
Chess(kr-vs-kp)	0.0287	0.0323	0.0225	0.0314	0.0382	0.0257
Hepatitis	0.0271	0.0226	0.0098	0.0179	0.0176	0.0195
Image Segmentation	0.0068	0.0046	0.0067	0.0098	0.0066	0.0087
Statlog (Australian Credit Approval)	0.0668	0.0764	0.0600	0.0660	0.0663	0.0649
Wall-Following Robot Navigation Data	0.0447	0.0500	0.0359	0.0395	0.0406	0.0408

Table 17. The number of selected features.

Dataset	BPSO	BGWO	PBROA_S3	PBROA_V2	PBROA_U3	PBROA_Z3
Breast Cancer	3.1	3.6	3.5	3.65	3.3	3.4
Breast Cancer Wisconsin	3.75	4.15	3.3	3.6	3.65	3.25
Glass	5.35	5.05	4.8	6.4	6.2	4.75
Iris	1	1.15	1	1	1	1
South German Credit	9.65	9	8.45	8.4	9.2	9
Wine	2.65	4.45	2.4	2.45	2.25	2.3
Flags	10.6	11.25	10	9.7	10.75	9.9
Dermatology	11.45	15.15	10.15	10.45	10.45	11.4
Credit Approval	5.2	6.35	5.5	4.55	5.15	5.15
Chess (kr-vs-kp)	19.5	19.95	19.45	22.6	29.15	18.95
Hepatitis	6.05	6.05	6.55	6.65	6.1	6.7
Image Segmentation	8.5	8.65	8.2	7.45	8	7.6
Statlog (Australian Credit Approval)	4.2	4.55	4.15	3.55	3.9	4.45
Wall-Following Robot Navigation	4.85	6.65	4.6	4.7	4.45	4.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pan, J.-S.; Shi, H.-J.; Chu, S.-C.; Hu, P.; Shehadeh, H.A. Parallel Binary Rafflesia Optimization Algorithm and Its Application in Feature Selection Problem. Symmetry 2023, 15, 1073. https://doi.org/10.3390/sym15051073

AMA Style

Pan J-S, Shi H-J, Chu S-C, Hu P, Shehadeh HA. Parallel Binary Rafflesia Optimization Algorithm and Its Application in Feature Selection Problem. Symmetry. 2023; 15(5):1073. https://doi.org/10.3390/sym15051073

Chicago/Turabian Style

Pan, Jeng-Shyang, Hao-Jie Shi, Shu-Chuan Chu, Pei Hu, and Hisham A. Shehadeh. 2023. "Parallel Binary Rafflesia Optimization Algorithm and Its Application in Feature Selection Problem" Symmetry 15, no. 5: 1073. https://doi.org/10.3390/sym15051073

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Parallel Binary Rafflesia Optimization Algorithm and Its Application in Feature Selection Problem

Abstract

1. Introduction

2. Related Works

2.1. ROA

2.2. Transfer Function

3. Analysis and Proposed Parallel Binary ROA

3.1. Mathematical Analysis

3.2. New Transfer Functions

3.3. Parallel Strategy

4. Experiments, Results, and Analysis

Experimental Results

5. Application of Feature Selection

5.1. Dataset

5.2. Experimental Results and Analysis

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI