NSGA-III-Based Deep-Learning Model for Biomedical Search Engines

With the advancements in biomedical imaging applications, it becomes more important to provide potential results for searching the biomedical imaging data. During the health emergency, tremors require efficient results at rapid speed to provide results to spatial queries using the Web. An efficient biomedical search engine can obtain the significant search intention and return additional important contents in which users have already indicated some interest.*e development of biomedical search engines is still an open area of research. Recently, many researchers have utilized various deep-learning models to improve the performance of biomedical search engines. However, the existing deep-learning-based biomedical search engines suffer from the overfitting and hyperparameter tuning problems. *erefore, in this paper, a nondominated-sorting-genetic-algorithm-III(NSGA-III-) based deep-learning model is proposed for biomedical search engines. Initially, the hyperparameters of the proposed deep-learning model are obtained using the NSGA-III.*ereafter, the proposed deep-learningmodel is trained by using the tuned parameters. Finally, the proposed model is validated on the testing dataset. Comparative analysis reveals that the proposed model outperforms the competitive biomedical search engine models.


Introduction
e advancements in biomedical imaging applications lead to the challenge of providing the significant results for searching the biomedical imaging data. During the health emergency, tremors require efficient results at rapid speed to provide results to spatial queries using the Web [1]. Search engines which allow obtaining specific medical contents along with the complementary and different details would considerably help biomedical researchers [2]. An efficient biomedical search engine can obtain the significant search intention and return additional important contents in which users have already shown some interest [3,4].
Numerous biomedical search engines have been implemented such as inverted index and Boolean retrieval [5][6][7][8]. But, index sizes are becoming exponentially large as the number of biomedical contents is increasing at a rapid rate [5].
us, ranking of biomedical contents has been achieved using their respective retrieval frequency [8]. It has been found that to compute the results when the biomedical queries have minimum similarity scores is still an open area of research [9][10][11].
Essie is a well-known biomedical search engine which is providing services to various websites at the National Library of Medicine. It is a phrase-based search engine with notion and term query expansion and probabilistic relevancy ranking. It has proven that a judicious group of exploiting document structure, phrase searching, and concept-based query expansion is a beneficial method for data retrieval in the biomedical field [12].
Bidirectional Encoder Representations from Transformers (BERTs) has shown good advancement in the field of biomedical search engines. In precision medicine, corresponding patients to appropriate investigational support or probable therapies is a difficult job which needs both biological and clinical information. To resolve it, BERTbased ranking models can provide fair comparisons [13]. A computer-based query recommendation model was designed which recommends semantically interchangeable terms based on an initial user-entered query [14,15]. e typical view of biomedical search engine is represented in Figure 1. Initially, potential features of biomedical images are obtained.
ereafter, similarity score is computed. A prediction model is then utilized to compute the image index. Finally, the obtained class wise results are returned to the users. e primary contributions of this paper are as follows: (1) An NSGA-III-based deep-learning model is proposed for biomedical search engines (2) e hyperparameters of the proposed deep-learning model are obtained using the NSGA-III (3) e proposed deep-learning model is trained by using the tuned parameters obtained from NSGA-III (4) Finally, the proposed model is validated on the testing dataset.
e remaining paper is organized as follows: e related work is presented in Section 2. Section 3 presents the proposed biomedical search engine. Comparative analysis is discussed in Section 4. Section 5 concludes the paper.

Related Work
Hsieh et al. [16] proposed a semantic similarity approach by utilizing the page counts of two biomedical contents obtained from Google AJAX web search engine. e features were extracted in co-occurrence forms by considering two provided words. Support vector machines (SVMs) were utilized for classification purpose. Mao and Tian [17] utilized TCMSearch as a semantic-based search engine for biomedical images. It has shown good results for biomedical contents. Wang et al. [3] designed an ontologygraph-based web search engine named as G-Bean for evaluating biomedical contents from the MEDLINE database. e multithreading parallel approach was utilized to obtain the document index to address. Kohlschein et al. [18] studied that ViLiP can be efficiently utilized to search the contents in PubMed. ViLiP was further improved using NLP-based semantic search engine for obtaining better drug-related contents within a query. Depending upon the linguistic annotations, significant drug names can be obtained.
Tsishkou et al. [19] designed a TTA10 approach which stores biomedical data in hierarchical fashion. Logarithmic complexity was utilized to retrieve a huge data repository. AdaBoost was utilized to integrate independent search results to obtain efficient results. Mao et al. [1] designed a prototype model of subject-oriented spatial-content-based search engine (SOSC) for critical public health hazards. It can obtain Web contents from the Internet, find the Web page database, and obtain spatial content during pandemic from these Web pages. Boulos [20] designed a GeoNames-powered PubMed search which has an ability to handle these problems. e geographic ontology can utilize potential words to obtain the significant results from the PubMed. Al Zamil and Can [21] improved the contextual retrieval and ranking performance (CRRP) with minimal input from researchers. e performance was evaluated using the retrieval procedure in terms of topical ranking, precision, and recall. Grandhe et al. [22] designed an ascendable search engine (ASE) for biomedical images. Researchers can select a region of interest iteratively to evaluate the corresponding region from the images. An efficient cluster-based engine was designed to reduce the content retrieval time. Mishra and Tripathi [23] proposed a vector-and deep-learning-(VDP-) based biomedical search engine model. e degree of similarity was computed by integrating the vector space and deep-learning model. e implementation of efficient biomedical search engines is still a challenging issue [24]. Recently, many researchers have utilized various deep-learning models to improve the performance of biomedical search engines [25]. However, the existing deep-learning-based biomedical search engines suffer from the overfitting and hyperparameter tuning problems.

Proposed Model
is section discusses the proposed biomedical search engine. Initially, the deep-learning model is discussed.
ereafter, the tuning of the deep-learning model is achieved using the NSGA-III.

Deep Convolutional Neural Network.
e deep convolutional neural network (CNN) is widely accepted as a classification problem, and many researchers have utilized it in the field of search engines. Figure 2 shows the deeplearning-model-based biomedical search engine. It utilizes numerous convolution filters to extract the potential features.
We assume a single channel which is mathematically computed as ⊂ � n 1 , n 2 , n 3 , . . . , n m . (1) Here, ⊂∈ P m×r . r shows the dimension of every input factor. m represents the number of images. During convolution operation, a filter u ∈ P kr is utilized to extract the potential features as where d ∈ P represents a bias. n b:b+k−1 is the integration of n b , . . . , n b+k−1 . R shows an activation function. e filter u approaches n 1:k , n 2: k+1 , . . . , n m−k+1: m . us, feature map can be computed as a � a 1 , a 2 , . . . , a m−k+1 .
Maxpool is applied on a to compute peak value as a � max A { }. It shows the final feature obtained using u.
where z shows the number of filters. Softmax (z R ) is utilized to evaluate the prediction probability as We assume a training set (a b , j b ), where j b ∈ 1, 2, . . . , n { } shows the similarity of the biomedical image query for search engine A b , and the prediction probability of the proposed model is j b t ∈ [0, 1] for each label t ∈ 1, 2, . . . , n { }. e computed error can be defined as Here, n represents the labels of e gradient descent is then utilized to update the deeplearning variables.

Nondominated Sorting Genetic Algorithm-III.
Nondominated sorting algorithm-III (NSGA-III) [26] is widely accepted to solve numerous engineering applications. Recently, many researchers have utilized NSGA-III to solve hyperparameter tuning issues with deep-learning models [9,11,27]. e nomenclature of NSGA-III is demonstrated in Table 1. e generation of the initial population is represented in Algorithm 1. Initially, the random population is computed.
e computed solutions are then encoded to the initial attributes of CNN.
NSGA-III-and CNN-based biomedical search engines are discussed in Algorithm 2.
e random-population-based CNN models are trained on the chunk of biomedical dataset. Fitness of the computed CNN models is then evaluated. Solutions are then divided into dominated and nondominated groups. Crossover and mutation operators are further employed to compute the children. Nondominated sorting () is implemented to sort the nondominated solutions. Based upon the termination criteria, the tuned parameters of CNN models are returned.

Performance Analysis
e proposed biomedical search engine is implemented on MATLAB 2019a with the help of deep-learning and image processing toolboxes. e proposed and the existing models are tested on the biomedical search engine dataset. e proposed model is compared with the competitive models such as TCMSearch [17], SVM [16], G-Bean [3], TTA10 [19], ViLiP [18], SOSC [1], GeoNames [20], CRRP [21], ASE [22], and VDP [23]. To compute the performance of the NSGA-III-based CNN model, median and variation values (i.e., median ±IQR × 1.5) are computed. 70% of biomedical dataset is used for building the model. 15% of the dataset is used for validation purpose. Remaining 15% is used for testing purpose. e training and validation loss analysis of the NSGA-III-based CNN model are represented in Figure 3. It clearly shows that the loss difference between training and validation is significantly lesser; therefore, the NSGA-III-based CNN model is least affected from the overfitting issue. Additionally, the loss approaches towards −60 and convergence during the 92 nd epoch. us, the proposed model is trained efficiently on the biomedical images.
Training and testing analysis of the NSGA-III-based CNN model are depicted in Tables 2 and 3. Specificity, area under curve (AUC), sensitivity, f-measure, and accuracy metrics have been utilized to evaluate the performance of the NSGA-III-based CNN model over competitive models such as TCMSearch [17], SVM [16], G-Bean [3], TTA10 [19], ViLiP [18], SOSC [1], GeoNames [20], CRRP [21], ASE [22], and VDP [23]. It has been observed that the proposed model outperforms the competitive models. e bold indicates the highest performance of biomedical search engines. Comparative analysis reveals that the proposed model outperforms the competitive biomedical search engine models in terms of specificity, AUC, sensitivity, f-measure, and accuracy by 1.5828%, 1.8372, 1.8328, 1.4838, and 1.4828, respectively. e comparative analysis of the NSGA-III-based CNN model with the state-of-the-art approaches is depicted in Table 4. It has been observed that the NSGA-III-based CNN model achieves significantly better results than the existing web search engines.

Conclusions
is paper has proposed an efficient model for biomedical search engines. It has been found that the deep-learning models can be used to improve the performance of the biomedical search engines. However, the existing deeplearning-based biomedical search engines suffer from the overfitting and hyperparameter tuning problems. erefore, an NSGA-III-based CNN model was proposed for biomedical search engines. Initially, the hyperparameters of the proposed model were obtained using the NSGA-III.
ereafter, the proposed CNN model was trained by using the tuned parameters. Finally, the proposed model is validated on the testing dataset. Comparative analysis reveal that the proposed model outperforms the competitive biomedical search engine models in terms of specificity, AUC, sensitivity, f-measure, and accuracy by 1.5828%, 1.8372, 1.8328, 1.4838, and 1.4828, respectively.

Data Availability
e dataset used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare that they have no conflicts of interest.