Personalized Body Constitution Inquiry Based on Machine Learning

Fan, Baochao; Li, Yanghui; Wen, Guihua; Ren, Yan; Lu, Yantong; Wang, Ziying; Zhang, Yuan; Wang, Changjun

doi:https://doi.org/10.1155/2020/8834465

Journal of Healthcare Engineering

On this page

Abstract Introduction Related Work Materials and Methods Results Discussion Conclusion Data Availability Disclosure Conflicts of Interest Acknowledgments Supplementary Materials References Copyright Related Articles

Research Article | Open Access

Volume 2020 | Article ID 8834465 | https://doi.org/10.1155/2020/8834465

Personalized Body Constitution Inquiry Based on Machine Learning

Baochao Fan,^1,2Yanghui Li,³Guihua Wen,³Yan Ren,^1,2Yantong Lu,^1,2Ziying Wang,^2,3Yuan Zhang,^2,4and Changjun Wang^1,2

Academic Editor: Saverio Maietta

Received28 Jun 2020

Accepted30 Oct 2020

Published12 Nov 2020

Abstract

Background. Body constitution (BC) is the abstract concept indicating the state of a person’s health in Traditional Chinese Medicine (TCM). The doctor identifies the body constitution of the patient through inspection and inquiry. Previous research simulates doctors to identify BC types according to a patient’s objective physical indicators. However, the lack of subjective feeling information can reduce the accuracy of the machine to imitate the doctor’s diagnosis. The Constitution in Chinese Medicine Questionnaire (CCMQ) is used to collect subjective information but suffers from low acquisition efficiency. Methods. This paper presents a personalized body constitution inquiry method based on a machine learning technique. It employs a random generator, a feature extractor, and a classifier to simulate the doctor inquiry and generate a personalized questionnaire. Specifically, the feature extractor evaluates and sorts the question of the constitution in the CCMQ based on the recognition results of the tongue coating image of patients. The sorted questions and relevant BC label are inputted into the classifier; the best questions are screened out for patients. Results. The experimental results show that our method can select personalized questions from the CCMQ for the patients, significantly reducing the time and the number of questions to answer. It also improves the accuracy of recognizing BC. Compared with the CCMQ, patients had 68.3% fewer questions to answer and the time occupied by answering is reduced by 80.3%. Conclusions. The proposed method can simulate the doctor's inquiry and pick out personalized questions for patients. It can act as auxiliary diagnosis tools to collect subjective patient feelings and help make further judgments on the patient’s BC types.

1. Introduction

Based on the innate inheritance of the human body and the influence of acquired factors, Body Constitution (BC) is comprehensively expressed through various aspects such as psychological state, viscera function, metabolic function, and human morphology. BC is an inherent characteristic of a relatively stable human body [1]. BC type of a patient can help doctors understand the patients’ health status and disease outcome, and then develop the targeted prevention, treatment, and rehabilitation programs. There are nine BC types according to the constitution theory in Traditional Chinese Medicine (TCM). They are Balanced Constitution, Qi-deficient Constitution, Yang-deficient Constitution, Yin-deficient Constitution, Phlegm-dampness Constitution, Damp-heat Constitution, Stagnant Blood Constitution, Stagnant Qi Constitution, and Inherited Special Constitution, where balanced constitution is well health status and the others are pathological [1]. BC identification is the research foundation of the constitution in Chinese medicine theory. Wang et al. developed the Constitution in Chinese Medicine Questionnaire (CCMQ) and the body constitution identification standard, which integrates multiple disciplines such as epidemiology, immunology, genetics, and mathematical statistics [2]. In the clinic, doctors carry on the BC Identification to the patient through the inspection and the inquiry result. The content of the inquiry comes from the questions in CCMQ [3, 4].

Artificial intelligence has integrated into the medical field development over the past decade. For TCM, the combination of emerging technologies and traditional theories has created several high-tech medical devices [5] that assist TCM doctors in judging diseases based on the intuitive data information, and even exploiting the rules of TCM from the collecting medical data [6] and simulate how the doctor diagnoses and prescribes. As for BC identification, researchers judge BC types by using machine learning to analyze the characteristic of the pulse [7], tongue [8], and face [9], which simulates the doctor's inspection. However, from the medical point of view, objective and subjective factors need to be considered comprehensively in clinical diagnosis. The image-BC identification technology analyzes the objective information of patients, while the subjective feeling information of patients is not analyzed. The lack of subjective feeling information can reduce the accuracy of the machine to imitate the doctor’s diagnosis. Therefore, we propose to analyze the subjective feeling information of patients by adding questionnaire data, which simulate the doctor inquiry. Specifically, we use the CCMQ to collect the patient’s subjective feelings to make further judgments on their BC types. However, CCMQ has some disadvantages during collecting data. For example, the number of questions is large; thus, patients will spend a long time on it. Many patients will be impatient when filling out the questionnaire, which will affect their choices. Patients in hospitals and community clinics often worry about their illness or experience anxiety while waiting for a doctor’s consultation, and therefore they do not have the patience to answer more questions, which leads to deviations in their BC identification. In summary, it is not realistic to use the full questionnaire as the basis for collecting data. In clinical practice, the doctor narrows the range of diagnoses by inspection and asks personalized questions according to the results of inspection and determines the diagnosis results finally. Therefore, how to push personalized and precise questions for each patient is the key to simulate the doctor inquiry. This paper proposes a BC identification method based on a random generation algorithm, feature selection algorithm, and classification methods. This method uses feature selection and classification technology for screening of the CCMQ problems and pushes personalized questions to patients according to the results of the inspection. We provide a complete questionnaire and evaluation criteria in the appendix.

The main contributions of this paper are as follows:(1)Inspired by the doctor’s diagnosis process, the questions of the CCMQ are introduced to quantify the patient’s subjective feelings to improve the accuracy of recognizing BC automatically. The CCMQ has the disadvantages of low acquisition efficiency and susceptibility to interference. We use feature selection algorithms to achieve the selection of personalized questions, which can improve collection efficiency.(2)In order to simulate the doctor inquiry, we propose a BC identification method based on a random generation algorithm, feature selection algorithm, and classification methods, and construct a body constitution identification model (BCIM) (Figure 1).(3)We adjusted the output range of the image-BC (identifying body constitution by the human body image) identification results to the first three, which provides the identification scope to make a further judgment (Figure 1). In order for the BCIM to be able to handle all output situations of the image-BC recognition method, we combine nine BC types into the combinations each of which contains three different BC without considering the combination order. Each BC combination represents an output situation. According to the BC types contained in each combination, original sample data are processed and corresponding BCIM are constructed (Figure 2).(4)Using the doctor’s judgments as references for comparison with the image-BC identification method, the accuracy of our method is improved by 25.8%. Compared with the CCMQ, patients had 68.3% fewer questions to answer than it, and the time occupied by answering is reduced by 80.3%.

Figure 2

Construction of the BC combination and the BCIM. BC_: Nine BC types; C_: BC_1; BC_2; BC_3: The BC combination number and the BC types; “⟶”: The BC combination matches the corresponding data in the original sample data to obtain its own original sample data ; “⟶”: The feature selection algorithm measures the importance score W of each question in each BC combination. The filtered questions collate the original sample data to obtain the new sample data ; “⟶”: Based on the new sample data , the identification model is constructed through the classifier; BCIM_: The identification model corresponding to each BC combination; they are all part of the BCIM.

Feature selection [10] is a key technology in processing high-dimensional data in computer pattern recognition tasks. Its function is to obtain as small a subset of features as possible to improve the effect of classification, clustering, and retrieval without significantly reducing the classification effect. Therefore, feature selection algorithms are often used in conjunction with classifiers. There is a huge amount of data in the medical field, and this will continue to grow as new technologies become available. Faced with an increasing amount of information and data types, the feature selection algorithm and classifier play a vital role in helping doctors obtain disease information most relevant to disease diagnosis from large amounts of data. In the past ten years, the application of feature selection and classifiers has been deeply applied in various disciplines and in the medical field [11]. For example, biomicroarray data analysis [12], biomedical signal processing [13–15], medical imaging [16, 17], medical modeling [18], disease diagnosis classification [19], and medical diagnostic system development [20] have achieved innovative breakthroughs. Such breakthroughs have been especially seen in big data biological information processing [21] and big data information mining [22]. A doctor can predict a human’s birth [23] and death [24] from physiological indicators through screening by feature selection technology. For drug development, feature selection and classifiers are used to predict functional classes of newly generated protein sequences [25] and protein inhibitors and substrates [26]. In clinical tests, they are used to predict the rate of amyloid aggregation [27] and the production of high antivasoactive peptides [28]. In summary, the main idea of these studies is to narrow the feature set by filtering the relevant data through a combination of single or multiple feature algorithms [29] and then enter the new feature set into the classifier to classify and find the indicator that is most closely related to the disease. At present, there are few studies on the application of machine learning methods in medical questionnaires. Most of them are concentrated on using a genetic algorithm (GA) to simplify questionnaires [30–32]. In this paper, we use the feature selection method and classifier technology to mine the characteristics of the scale questions and screen out highly targeted questions. The patient only needs to answer targeted questions about them to complete the data collection.

3. Materials and Methods

3.1. Task Description

The task of this paper is to build a BCIM based on the questionnaire options and judgment results of CCMQ. In our method, the BCIM takes the result of image-BC identification as input and outputs the corresponding CCMQ questions for the patient to answer. Finally, the BCIM judges the BC type based on the patient’s answer (Figure 1).

3.2. Data Collection

The dataset for this study was generated by the random generation algorithm. We simulated the patient’s answer by randomly selecting the options, and each simulation result was taken as a sample of the dataset. In order to facilitate the collection of data in the dataset, we combined two questions (60_1 and 60_2 in the CCMQ) that need to be answered according to gender attributes. We randomly assign a score of 1–5 to the 60 questions to generate m different samples . The ith sample in A is denoted as . , , , and denotes the patient’s score when answering the jth question. According to the CCMQ’s judgment method and standard, we calculate and distinguish the BC types of each answer. There are m judging results . Given the answers of the ith sample, we apply the questions-based body constitution algorithm proposed in the CCMQ to get the BC types of the ith sample. Finally, we construct the original dataset of m samples .

3.3. Construction of the BC Combination

In order to add questionnaire data to BC identification, we propose to construct the BC combination. In clinical practice, doctors narrow the range of diagnoses by inspection, but the final diagnosis still needs the assistance of inquiry. The output process of the image-BC identification method [7–9] is to output the probabilities of all types of the constitution first, and then selects the category with the highest probability as the judgment result. Therefore, in this study, we adjusted the output range of the image-BC identification results in the first three, which provides the identification scope for BCIM to make a further judgment (Figure 1). During the BCIM construction, in order to consider all the possible output situations of the image-BC recognition method, we combine nine BC types into the combinations, each of which contains three different BC without considering the combination order. According to the BC type contained in each combination, the original sample data is merged to construct the corresponding BCIM (Figure 2). It means that each BC combination has its own original sample data . ( represents the cth BC combination).

3.4. Screening the Representative Problems for BC Identification

Take the BC combination as a unit; the feature selection algorithm screens representative problems to build a dataset for constructing the BCIM. The feature selection algorithm (FS) measures the importance score W of each question in the BC combination, which can be abstracted as the following function:

We set the number k of filtered questions . We pick the top k questions with the highest importance scores from the original questions contained in each the BC combination. Next, we combine them into a new problem set . This means that each BC combination has a specific new problem set . The new question set is the set of questions that the patient needs to answer.

3.5. Construction of the BCIM

The new problem set obtained by the feature selection algorithm is a subset of original questions Q, while the assessment method in the CCMQ needs the answer of all original questions. Therefore, the assessment method in the CCMQ is not applicable to the new problem set . In order to identify the corresponding BC types from it, we construct a BCIM to identify BC based on .

3.5.1. Processing Training Data

The original sample data is filtered by the new question set to get the new sample data . The algorithm is summarized as below Algorithm 1.

	Input:
	new problem set , original sample data
	output:
	new sample data
	initialize to an empty list
	for q in do
	get answers to questions q from ;
	add to ;
	end for

3.5.2. Construction and Training of the BCIM

For training BCIM, the answers of the new question set in the new sample data , are input into the classifier as the features of the classifier, and the BC is the output of the classifier. The model is continuously updated through iterative training. The operation can be abstracted as the following function:

4. Results

4.1. Dataset

The dataset of the BCIM is collected from the randomly generated samples. The dataset can be organized into the following two parts:(1)Original Dataset. For these 60 questions in the CCMQ, the randomly generated algorithm is used to generate 1,000,000 different answers, and each answer is marked with the BC type according to the CCMQ’s judgment method. A detailed description of the data is shown in Table 1.(2)The Datasetfor Training the BCIM. The BCIM is constructed based on the BC combination. Nine BC types are combined into combinations containing three different BC types, resulting in a total of BC combinations. The feature selection algorithm is used to screen out the top k, questions with the highest importance score in each BC combination. These questions are combined with options and BC labels in the original dataset to obtain a dataset for training the BCIM. There are nine constitution labels in the dataset, including Balanced Constitution, Qi-deficient Constitution, Yang-deficient Constitution, Yin-deficient Constitution, Phlegm-dampness Constitution, Damp-heat Constitution, Stagnant Blood Constitution, Stagnant Qi Constitution, and Inherited Special Constitution.

4.2. Experimental Setup

Filter is a feature selection algorithm that focuses on the general characteristics of the data and independent of the classifier. According to the filter’s ability to score the features of each dimension, we use the chi-squared stats algorithm to evaluate the importance score of each combination of questions and screen them. Models based on linear discriminate analysis (LDA), the artificial neural network (ANN), k-nearest neighbor (k-NN), random forest (RF), and support vector machine (SVM) are constructed and run for identifying BC.

4.3. Evaluation Metrics

In order to evaluate the identification performances of the five aforementioned models, we use a 5-fold cross-validation method to train and evaluate the models. The new sample data of each combination are divided into five equal parts, taking 4/5 of the samples for the training set and the remaining for the test set. Each combination performs training five times and the test samples taken for each time do not overlap. Finally, the mean accuracy of five evaluation results is used as the accuracy of the model. In addition, we use the Macro-averaging (Macro-Precision, Macro-Recall, Macro-F Score) and Micro-averaging (Micro-Precision, Micro-Recall, Micro-F Score) as indicators to evaluate the multiclassification of the classifier. The classifier’s response time is also an evaluation indicator, denoting the calculation speed of the model.

4.4. Results and Analysis

The new sample data are used as training objects to compare the identification performances of the five models. We randomly select 10 BC combinations for comparison. To visually compare the performance of the classifier, we take the highest accuracy of the 5-fold as the accuracy of the model and extract the evaluation metrics data of this point. The performance is shown in Figure 3. The values obtained by LDA, ANN, and SVM models are all above 90%. This shows that identifying BC by feature selection and the classifier is feasible and effective. Notably, in the case of different BC combinations, we found that the number of features used to construct the BCIM with LDA is less than that used by other classifiers. The number of features represents the number of screened questions k.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

Figure 3

Experimental performances of different models. A-G: The indicator to evaluate the multiclassification of the classifier. H: The number of features used to construct the model when the identification accuracy of the model is highest. C_1: BC combination: Balanced Constitution, Yang-deficient Constitution, Qi-deficient Constitution; C_2: BC combination: Damp-heat Constitution, Stagnant Blood Constitution, Stagnant Qi Constitution; C_3: BC combination: Yang-deficient Constitution, Phlegm-dampness Constitution, Stagnant Blood Constitution; C_4: BC combination: Balanced Constitution, Yin-deficient Constitution, Stagnant Blood Constitution; C_5: BC combination: Qi-deficient Constitution, Damp-heat Constitution, Stagnant Blood Constitution; C_6: BC combination: Qi-deficient Constitution, Stagnant Qi Constitution, Inherited Special Constitution; C_7: BC combination: Balanced Constitution, Yang-deficient Constitution, Phlegm-dampness Constitution; C_8: BC combination: Yang-deficient Constitution, Damp-heat Constitution, Stagnant Qi Constitution; C_9: BC combination: Yin-deficient Constitution, Stagnant Qi Constitution, Inherited Special Constitution; C_10: BC combination: Phlegm-dampness Constitution, Stagnant Blood Constitution, Inherited Special Constitution.

In order to further show and analyze the performance and difference of the classifier, we select two BC combinations that consist of the most common BC types in the survey [33] as examples from the comparison result. The specific data are shown in Figure 4 and Table 2. The abscissas in Figure 4 represent the number of features used to build the fitness discrimination model. The ordinate indicates the accuracy of the BCIM.

(a)

(b)

As we can see from Figure 4, the accuracy of all models would change with the number of features k. Models built by LDA and ANN work best, and their accuracies tend to stabilize after reaching the peak. The model built by SVM is not as good as the former two and is affected by the BC type in the combination. The accuracy of the model built by k-NN shows a downward trend after reaching the peak. This is because the increase in the number of features k results in an increase in the number of objects contained in the model and a decrease in the frequency of the correct category of objects. The accuracy of the model built by RF is low and shows disorderly fluctuations. This might be caused by the small number of features extracted by RF, resulting in a small subspace and low variety. In conclusion, the classification effect of LDA, ANN, and SVM models reach the experimental expectation.

In order to further compare the performance of the classifiers, we extracted the points with the highest accuracy of each model (Table 2). The value k represents the number of features used to construct the model making the identification accuracy of the model highest. For the Macro-averaging and the Micro-averaging, in the BC combination of Balanced Constitution, Yang-deficient Constitution, and Qi-deficient Constitution, the model built by ANN gets the highest score. All its accuracy, Micro-Precision, Micro-Recall, Micro-F Score, Macro-Recall, and Macro-F Score are 99.90%, and its Macro-Precision is 99.89%. Models built by LDA and SVM rank second and third. In the BC combination of the Yang-deficient Constitution, Phlegm-dampness Constitution, and the Stagnant Blood Constitution, the model built by LDA has the highest score. All its accuracy, Micro-Precision, Micro-Recall, Micro-F Score, Macro-Recall, and Macro-F Score are 99.84%, and the Macro-Precision is 99.83%. Models built by ANN and SVM rank second and third, respectively. Compared with the performance of the model about response time, the results of the other four classifiers except KNN are acceptable. LDA is the fastest one, whose response time is 0.009 seconds and 0.006 seconds, which means that the projection direction chosen by LDA can well classify the training data, which enables quick judgment during the test. The model built by KNN takes the most time, reaching 47 minutes, which means that the calculation efficiency of KNN is low. We believe that this is determined by the operation characteristics of KNN, which needs to calculate the distance between samples to be tested and each feature, so the calculation time of the classifier increases with the number of features.

Taken together, the performances of models built by LDA and ANN are the best and fit the purpose of our study. Although there is a slight difference between these classifiers, we can see that our proposed method can be applied to most classifiers and has high generalization. In this study, LDA is the best method to construct the BCIM if the number of questions to be answered by the patient is taken into account (Figure 3(h)).

4.5. Performance Comparison

In order to verify the performance of the BCIM constructed by the chi-squared stats algorithm and LDA in clinical practice application, we select the image body constitution identification model (image-BCIM) [8] as a baseline and use the doctor’s judgment and judgment results of the CCMQ as reference standards. This comparison experiment compares the effect of adding inquiry on the accuracy of BC identification based on inspection. Besides, we also record the number of questions and the time of answering them, which will be compared with the CCMQ.

74 volunteers participated in this comparison experiment. During the experiment, according to image-BCIM (simulating the doctor inspection), image-BCIM + BCIM (simulating the doctor inspection and inquiry) and doctor’s judgment, three methods were run for identifying the BC type of the volunteers. Another 70 volunteers performed physical examinations only by filling out the CCMQ. One example is presented in Table 3 to show the actual comparison results. The table is divided into three parts. The first part is the identification result of the image-BCIM. The second part is data from the image-BCIM + BCIM, in which we adjusted the output range of the image-BCIM identification results to the first three, in addition, the BCIM outputs questionnaire questions, identification result, and the time spent on filling and identifying. The third part is the doctor’s judgment results. It should be noted that there will be multiple results in the doctor judgment, all of which are the real result [34]. The human body is often in a subhealthy state with multiple unbalanced constitutions at the same time. This is common in elderly or frail people [35]. Therefore, when the result of the image-BCIM or the image-BCIM + BCIM matches one of the judgments of the doctor, we take it as a correct one.

The comparison results with volunteers’ participation are shown in Tables 4 and 5. In order to enhance the accuracy and persuasiveness of results, the evaluation results are averaged by test results. The values after “±” indicate the standard deviation of test results.

Taking the identification result of the doctor as the reference (Table 4), the accuracy rate of the image-BCIM + BCIM is 77.4%, which is 25.8% higher than that of the image-BCIM. Compared with routine answer time and the number of questions (Table 5), patients had 68.3% fewer questions to answer than that of CCMQ and the time occupied by answering is reduced by 80.3%.

5. Discussion

Our results show that the body constitution identification model based on feature selection and classifier can simulate the doctor inquiry that pushes targeted questions to patients and the subjective patient feelings are collected by it to make further judgments on the patient’s BC. The accuracy of recognizing BC automatically can be improved by combining questionnaire data with image-BC identification. Meanwhile, when collecting subjective feelings, the BCIM pushed targeted scale questions to the patients, which reduced the number of questions and the time needed to complete the questionnaire, and the identification efficiency was higher than the CCMQ. Clinically, doctors need to consider the patient’s objective signs (such as imaging reports and biochemical indicators) and subjective feelings (from the doctor’s consultation) to diagnose disease. According to the comparison experiment results in Tables 4 and 5, we have reason to believe that the improvement in the accuracy of BC identification is due to a comprehensive analysis of the patient’s objective information and subjective feelings.

Simulating the doctor inquiry using the feature selection and the classifier can provide us with a comparatively accurate result. In practice, there are one or two key unbalanced constitutions that dominate people’s current health. These key constitutions will continue to change with people’s habits, the external environment, and the treatment process. When faced with a patient with a composite constitution, analyzing the key constitution that currently affects their health is key to treatment. Although the predicted result may be one of the true BC types of the patient, it provides us with a kind of opinion for reference, which helps doctors quickly pinpoint the key BC type that most affects the patient.

6. Conclusion

This paper adds questionnaire data to image identification and uses the CCMQ to collect subjective feelings of patients to make further judgments on BC. In order to collect the questionnaire data of the patients more fully and effectively, we propose a BC identification method based on a random generation algorithm, feature selection algorithm, and classification method. We combine the method with the image identification method to compare the accuracy of BC identification before and after the combination. The results show that our method can improve the accuracy of BC identification, effectively avoid the shortcomings of the CCMQ when collecting information and improve the efficiency of BC identification. Through the comparison experiment, we show that artificial intelligence technology in the field of medicine to achieve the level of clinical diagnosis, the collection, and comprehensive analysis of objective and subjective factors is essential. The data processed by the method in this paper include but are not limited to the problems in the CCMQ, which is of referential significance to other clinical observation scales related to physiological or pathological indicators of patients.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Disclosure

The funder did not have any role in the design of the study, analysis, interpretation of data, or writing of the manuscript.

Conflicts of Interest

The authors declare they have no conflicts of interest for this study.

Acknowledgments

The authors would like to thank all the participants for their invaluable time. This study was supported by China National Science Foundation (Grant nos. 60973083 and 61273363), Science and Technology Planning Project of Guangdong Province (Grant nos. 2014A010103009 and 2015A020217002), and Guangzhou Science and Technology Planning Project (Grant nos. 201604020179 and 201803010088), and Guangdong Province Key Area R&D Plan Project (2020B1111120001).

Supplementary Materials

Constitution in Chinese Medicine Questionnaire (CCMQ). (Supplementary Materials)

References

L. Li, H. Yao, J. Wang, Y. Li, and Q. Wang, “The role of Chinese medicine in health maintenance and disease prevention: application of constitution theory,” The American Journal of Chinese Medicine, vol. 47, no. 3, pp. 495–506, 2019.
View at: Publisher Site | Google Scholar
Q. Wang, “Classification and diagnosis basis of nine basic constitutions in Chinese medicine,” Journal of Beijing University of Traditional Chinese Medicine, vol. 28, no. 4, p. 1, 2005.
View at: Google Scholar
Qi Wang, Z. Yan-bo, and He-sheng Xue, “Primary compiling of constitution in Chinese medicine questionnaire,” Chinese Journal of Clinical Rehabilitation, vol. 10, no. 3, pp. 12–14, 2006.
View at: Google Scholar
Z. Yan-bo, Qi Wang, and H. Xue, “Preliminary assessment on performance of constitution in Chinese medicine questionnaire,” Chinese Journal of Clinical Rehabilitation, vol. 10, no. 3, pp. 15–17, 2006.
View at: Google Scholar
X. Huang, P. Zhong, and G. Ma, “Artificial Intelligence and Chinese medicine intelligentization,” Journal of Traditional Chinese Medicine, vol. 58, no. 24, pp. 2076–2106, 2017.
View at: Publisher Site | Google Scholar
Hu Qinan and Yu Tong, “End-to-end syndrome differentiation of yin deficiency and yang deficiency in traditional Chinese medicine,” Computer Methods and Programs in Biomedicine, vol. 174, pp. 9–15, 2018.
View at: Publisher Site | Google Scholar
H. Li, B. Xu, N. Wang, and J. Liu, “Deep convolutional neural networks for classifying body constitution,” in Proceedings of the International Conference on Artificial Neural Networks, Springer, Cham, Switzerland, September 2016.
View at: Publisher Site | Google Scholar
J. Ma, Zero-shot Learning Method for Body Constitution Recognition Based on Tongue Image, South China University of Technology, Guangzhou, China, 2019.
View at: Publisher Site
H. Er-Yang, W. Gui-Hua, and Z. Shi-Jun, “Deep convolutional neural networks for classifying body constitution based on face image,” J. Computational and Mathematical Methods in Medicine, vol. 2017, pp. 1–9, 2017.
View at: Publisher Site | Google Scholar
N. Ghanbari, “A review of feature selection methods with the applications in pattern recognition in the last decade,” in Fundamental Research in Electrical Engineering. Lecture Notes in Electrical Engineering, S. Montaser Kouhsari, Ed., vol. 480,, Springer, Singapore, Singapore, 2019.
View at: Publisher Site | Google Scholar
R. Beatriz and B.-C. Veronica, “A review of feature selection methods in medical applications,” Journal of Computers in Biology and Medicine, vol. 112, Article ID 103375, 2019.
View at: Publisher Site | Google Scholar
F. S. Fogliatto, M. J. Anzanello, F. Soares, and P. G. Brust-Renck, “Decision support for breast cancer detection: classification improvement through feature selection,” Cancer Control, vol. 26, no. 1, 2019.
View at: Publisher Site | Google Scholar
P. Geethanjali and V. Raunak, “Identification of a feature selection based pattern recognition scheme for finger movement recognition from multichannel EMG signals,” Journal of the Australasian College of Physical Scientists and Engineers in Medicine, vol. 41, no. 2, pp. 549–559, 2018.
View at: Publisher Site | Google Scholar
S. Dongkoo, K. Im, and P. Jeong-Ho, “Emotional stress state detection using genetic algorithm-based feature selection on EEG signals,” International Journal of Environmental Research and Public Health, vol. 15, no. 11, 2018.
View at: Publisher Site | Google Scholar
G. Singh, B. Singh, and M. Kaur, “Grasshopper optimization algorithm-based approach for the optimization of ensemble classifier and feature selection to classify epileptic EEG signals,” Medical & Biological Engineering & Computing, vol. 57, no. 6, pp. 1323–1339, 2019.
View at: Publisher Site | Google Scholar
J. D. Álvarez, J. A. Matias-Guiu, M. N. Cabrera-Martín, J. L. Risco-Martín, and J. L. Ayala, “An application of machine learning with feature selection to improve diagnosis and classification of neurodegenerative disorders,” BMC Bioinformatics, vol. 20, no. 1, p. 491, 2019.
View at: Publisher Site | Google Scholar
H. Fan, D. Behdad, and T. Tan, “Retinal artery/vein classification using genetic-search feature selection,” Journal of Computer and Methods in Programs of Biomedecine, vol. 161, pp. 197–207, 2018.
View at: Publisher Site | Google Scholar
J. Farrukh, T. Ilias, and M. Mevludin, “A comparison of feature selection methods when using motion sensors data: a case study in Parkinson’s disease,” Journal of Conference and Proceedings in IEEE Engineering in Medicine and Biology Society, vol. 2018, pp. 5426–5429, 2018.
View at: Publisher Site | Google Scholar
C. Kang, Y. Huo, L. Xin, B. Tian, and B. Yu, “Feature selection and tumor classification for microarray data using relaxed Lasso and generalized multi-class support vector machine,” Journal of Theoretical Biology, vol. 463, pp. 77–91, 2019.
View at: Publisher Site | Google Scholar
F. Li, C. Zhao, Z. Xia, Y. Wang, X. Zhou, and G.-Z. Li, “Computer-assisted lip diagnosis on Traditional Chinese Medicine using multi-class support vector machines,” BMC Complementary and Alternative Medicine, vol. 12, no. 1, p. 127, 2012.
View at: Publisher Site | Google Scholar
L. Wang, Y. Wang, and Q. Chang, “Feature selection methods for big data bioinformatics: a survey from the search perspective,” Journal of Methods, vol. 111, pp. 21–31, 2016.
View at: Publisher Site | Google Scholar
R. J. Urbanowicz, R. S. Olson, P. Schmitt, M. Meeker, and J. H. Moore, “Benchmarking relief-based feature selection methods for bioinformatics data mining,” Journal of Biomedical Informatics, vol. 85, pp. 168–188, 2018.
View at: Publisher Site | Google Scholar
Z. Cömert, A. Şengür, Ü. Budak, and A. F. Kocamaz, “Prediction of intrapartum fetal hypoxia considering feature selection algorithms and machine learning models,” Health Information Science and Systems, vol. 7, no. 1, p. 17, 2019.
View at: Publisher Site | Google Scholar
E. Elias, F. Alireza, and S. Mohammad, “An optimal strategy for prediction of sudden cardiac death through a pioneering feature-selection approach from HRV signal,” Journal of Computer Methods and Programs in Biomedicine, vol. 169, pp. 19–36, 2019.
View at: Publisher Site | Google Scholar
P. Debasmita, P. Sudarsan, and B. Sahoo, “Enzyme classification using multiclass support vector machine and feature subset selection,” Journal of Computers in Biological Chemistry, vol. 70, pp. 211–219, 2017.
View at: Publisher Site | Google Scholar
C. G. Gonzalo and G.-P. Nicolás, “Boosted feature selectors: a case study on prediction P-gp inhibitors and substrates,” Journal of Computer-Aided Molecular Design, vol. 32, no. 11, pp. 1273–1294, 2018.
View at: Publisher Site | Google Scholar
W. Yang, P. Tan, X. Fu, and L. Hong, “Prediction of amyloid aggregation rates by machine learning and feature selection,” The Journal of Chemical Physics, vol. 151, no. 8, p. 84106, 2019.
View at: Publisher Site | Google Scholar
B. J. Liñares, B. Porto-Pazos Ana, and P. Alejandro, “Prediction of high anti-angiogenic activity peptides in silico using a generalized linear model and feature selection,” Journal of Science Reports, vol. 8, no. 1, p. 15688, 2018.
View at: Publisher Site | Google Scholar
G.-C. Yolanda, G.-Z. Begonya, and G.-B. Marian, “Automatic migraine classification via feature selection committee and machine learning techniques over imaging and questionnaire data,” Journal of BMC Medical Informatics and Decision Making, vol. 17, no. 1, p. 38, 2017.
View at: Publisher Site | Google Scholar
H. Eisenbarth, S. O. Lilienfeld, and T. Yarkoni, “Using a genetic algorithm to abbreviate the psychopathic personality inventory-revised (PPI-R),” Psychological Assessment, vol. 27, no. 1, pp. 194–202, 2015.
View at: Publisher Site | Google Scholar
B. K. Sahdra, J. Ciarrochi, P. Parker, and L. Scrucca, “Using genetic algorithms in a large nationally representative american sample to abbreviate the multidimensional experiential avoidance questionnaire,” Frontiers in Psychology, vol. 7, p. 189, 2016.
View at: Publisher Site | Google Scholar
R. Enny, H. Chien-Yeh, and N. Nurjanah, “Developing an Indonesia’s health literacy short-form survey questionnaire (HLS-EU-SQ10-IDN) using the feature selection and genetic algorithm,” Journal of Computer Methods in Programs and Biomedicine, vol. 182, Article ID 105047, 2019.
View at: Publisher Site | Google Scholar
K. Hu, S. Xia, and M. Fan, “Investigation and analysis of the TCM constitution of 53693 elderly residents in Zhongshan,” Shenzhen Journal of Integrated Traditional Chinese and Western Medicine, vol. 27, no. 15, pp. 74–76, 2017.
View at: Publisher Site | Google Scholar
P. Sun, J. Wang, and Q. Wang, “Study on identification and intervention of composite constitutions,” Journal of Beijing University of Traditional Chinese Medicine, vol. 42, no. 2, pp. 99–102, 2019.
View at: Publisher Site | Google Scholar
T. Zhi-hui, W. Qi, Z. Yan et al., “Investigation and analysis of TCM constitution of Korean elderly group by applying Wang Qi’s nine TCM constitution questionnaire,” Guiding Journal of Traditional Chinese Medicine and Pharmacology, vol. 25, no. 19, pp. 86–89, 2019.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2020 Baochao Fan et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

860

Downloads

780

Citations

Journal of Healthcare Engineering

Personalized Body Constitution Inquiry Based on Machine Learning

Abstract

1. Introduction

2. Related Work

3. Materials and Methods

3.1. Task Description

3.2. Data Collection

3.3. Construction of the BC Combination

3.4. Screening the Representative Problems for BC Identification

3.5. Construction of the BCIM

3.5.1. Processing Training Data

3.5.2. Construction and Training of the BCIM

4. Results

4.1. Dataset

4.2. Experimental Setup

4.3. Evaluation Metrics

4.4. Results and Analysis

4.5. Performance Comparison

5. Discussion

6. Conclusion

Data Availability

Disclosure

Conflicts of Interest

Acknowledgments

Supplementary Materials

References

Copyright