Abstract

Background. Body constitution (BC) is the abstract concept indicating the state of a person’s health in Traditional Chinese Medicine (TCM). The doctor identifies the body constitution of the patient through inspection and inquiry. Previous research simulates doctors to identify BC types according to a patient’s objective physical indicators. However, the lack of subjective feeling information can reduce the accuracy of the machine to imitate the doctor’s diagnosis. The Constitution in Chinese Medicine Questionnaire (CCMQ) is used to collect subjective information but suffers from low acquisition efficiency. Methods. This paper presents a personalized body constitution inquiry method based on a machine learning technique. It employs a random generator, a feature extractor, and a classifier to simulate the doctor inquiry and generate a personalized questionnaire. Specifically, the feature extractor evaluates and sorts the question of the constitution in the CCMQ based on the recognition results of the tongue coating image of patients. The sorted questions and relevant BC label are inputted into the classifier; the best questions are screened out for patients. Results. The experimental results show that our method can select personalized questions from the CCMQ for the patients, significantly reducing the time and the number of questions to answer. It also improves the accuracy of recognizing BC. Compared with the CCMQ, patients had 68.3% fewer questions to answer and the time occupied by answering is reduced by 80.3%. Conclusions. The proposed method can simulate the doctor's inquiry and pick out personalized questions for patients. It can act as auxiliary diagnosis tools to collect subjective patient feelings and help make further judgments on the patient’s BC types.

1. Introduction

Based on the innate inheritance of the human body and the influence of acquired factors, Body Constitution (BC) is comprehensively expressed through various aspects such as psychological state, viscera function, metabolic function, and human morphology. BC is an inherent characteristic of a relatively stable human body [1]. BC type of a patient can help doctors understand the patients’ health status and disease outcome, and then develop the targeted prevention, treatment, and rehabilitation programs. There are nine BC types according to the constitution theory in Traditional Chinese Medicine (TCM). They are Balanced Constitution, Qi-deficient Constitution, Yang-deficient Constitution, Yin-deficient Constitution, Phlegm-dampness Constitution, Damp-heat Constitution, Stagnant Blood Constitution, Stagnant Qi Constitution, and Inherited Special Constitution, where balanced constitution is well health status and the others are pathological [1]. BC identification is the research foundation of the constitution in Chinese medicine theory. Wang et al. developed the Constitution in Chinese Medicine Questionnaire (CCMQ) and the body constitution identification standard, which integrates multiple disciplines such as epidemiology, immunology, genetics, and mathematical statistics [2]. In the clinic, doctors carry on the BC Identification to the patient through the inspection and the inquiry result. The content of the inquiry comes from the questions in CCMQ [3, 4].

Artificial intelligence has integrated into the medical field development over the past decade. For TCM, the combination of emerging technologies and traditional theories has created several high-tech medical devices [5] that assist TCM doctors in judging diseases based on the intuitive data information, and even exploiting the rules of TCM from the collecting medical data [6] and simulate how the doctor diagnoses and prescribes. As for BC identification, researchers judge BC types by using machine learning to analyze the characteristic of the pulse [7], tongue [8], and face [9], which simulates the doctor's inspection. However, from the medical point of view, objective and subjective factors need to be considered comprehensively in clinical diagnosis. The image-BC identification technology analyzes the objective information of patients, while the subjective feeling information of patients is not analyzed. The lack of subjective feeling information can reduce the accuracy of the machine to imitate the doctor’s diagnosis. Therefore, we propose to analyze the subjective feeling information of patients by adding questionnaire data, which simulate the doctor inquiry. Specifically, we use the CCMQ to collect the patient’s subjective feelings to make further judgments on their BC types. However, CCMQ has some disadvantages during collecting data. For example, the number of questions is large; thus, patients will spend a long time on it. Many patients will be impatient when filling out the questionnaire, which will affect their choices. Patients in hospitals and community clinics often worry about their illness or experience anxiety while waiting for a doctor’s consultation, and therefore they do not have the patience to answer more questions, which leads to deviations in their BC identification. In summary, it is not realistic to use the full questionnaire as the basis for collecting data. In clinical practice, the doctor narrows the range of diagnoses by inspection and asks personalized questions according to the results of inspection and determines the diagnosis results finally. Therefore, how to push personalized and precise questions for each patient is the key to simulate the doctor inquiry. This paper proposes a BC identification method based on a random generation algorithm, feature selection algorithm, and classification methods. This method uses feature selection and classification technology for screening of the CCMQ problems and pushes personalized questions to patients according to the results of the inspection. We provide a complete questionnaire and evaluation criteria in the appendix.

The main contributions of this paper are as follows:(1)Inspired by the doctor’s diagnosis process, the questions of the CCMQ are introduced to quantify the patient’s subjective feelings to improve the accuracy of recognizing BC automatically. The CCMQ has the disadvantages of low acquisition efficiency and susceptibility to interference. We use feature selection algorithms to achieve the selection of personalized questions, which can improve collection efficiency.(2)In order to simulate the doctor inquiry, we propose a BC identification method based on a random generation algorithm, feature selection algorithm, and classification methods, and construct a body constitution identification model (BCIM) (Figure 1).(3)We adjusted the output range of the image-BC (identifying body constitution by the human body image) identification results to the first three, which provides the identification scope to make a further judgment (Figure 1). In order for the BCIM to be able to handle all output situations of the image-BC recognition method, we combine nine BC types into the combinations each of which contains three different BC without considering the combination order. Each BC combination represents an output situation. According to the BC types contained in each combination, original sample data are processed and corresponding BCIM are constructed (Figure 2).(4)Using the doctor’s judgments as references for comparison with the image-BC identification method, the accuracy of our method is improved by 25.8%. Compared with the CCMQ, patients had 68.3% fewer questions to answer than it, and the time occupied by answering is reduced by 80.3%.

Feature selection [10] is a key technology in processing high-dimensional data in computer pattern recognition tasks. Its function is to obtain as small a subset of features as possible to improve the effect of classification, clustering, and retrieval without significantly reducing the classification effect. Therefore, feature selection algorithms are often used in conjunction with classifiers. There is a huge amount of data in the medical field, and this will continue to grow as new technologies become available. Faced with an increasing amount of information and data types, the feature selection algorithm and classifier play a vital role in helping doctors obtain disease information most relevant to disease diagnosis from large amounts of data. In the past ten years, the application of feature selection and classifiers has been deeply applied in various disciplines and in the medical field [11]. For example, biomicroarray data analysis [12], biomedical signal processing [1315], medical imaging [16, 17], medical modeling [18], disease diagnosis classification [19], and medical diagnostic system development [20] have achieved innovative breakthroughs. Such breakthroughs have been especially seen in big data biological information processing [21] and big data information mining [22]. A doctor can predict a human’s birth [23] and death [24] from physiological indicators through screening by feature selection technology. For drug development, feature selection and classifiers are used to predict functional classes of newly generated protein sequences [25] and protein inhibitors and substrates [26]. In clinical tests, they are used to predict the rate of amyloid aggregation [27] and the production of high antivasoactive peptides [28]. In summary, the main idea of these studies is to narrow the feature set by filtering the relevant data through a combination of single or multiple feature algorithms [29] and then enter the new feature set into the classifier to classify and find the indicator that is most closely related to the disease. At present, there are few studies on the application of machine learning methods in medical questionnaires. Most of them are concentrated on using a genetic algorithm (GA) to simplify questionnaires [3032]. In this paper, we use the feature selection method and classifier technology to mine the characteristics of the scale questions and screen out highly targeted questions. The patient only needs to answer targeted questions about them to complete the data collection.

3. Materials and Methods

3.1. Task Description

The task of this paper is to build a BCIM based on the questionnaire options and judgment results of CCMQ. In our method, the BCIM takes the result of image-BC identification as input and outputs the corresponding CCMQ questions for the patient to answer. Finally, the BCIM judges the BC type based on the patient’s answer (Figure 1).

3.2. Data Collection

The dataset for this study was generated by the random generation algorithm. We simulated the patient’s answer by randomly selecting the options, and each simulation result was taken as a sample of the dataset. In order to facilitate the collection of data in the dataset, we combined two questions (60_1 and 60_2 in the CCMQ) that need to be answered according to gender attributes. We randomly assign a score of 1–5 to the 60 questions to generate m different samples . The ith sample in A is denoted as . , , , and denotes the patient’s score when answering the jth question. According to the CCMQ’s judgment method and standard, we calculate and distinguish the BC types of each answer. There are m judging results . Given the answers of the ith sample, we apply the questions-based body constitution algorithm proposed in the CCMQ to get the BC types of the ith sample. Finally, we construct the original dataset of m samples .

3.3. Construction of the BC Combination

In order to add questionnaire data to BC identification, we propose to construct the BC combination. In clinical practice, doctors narrow the range of diagnoses by inspection, but the final diagnosis still needs the assistance of inquiry. The output process of the image-BC identification method [79] is to output the probabilities of all types of the constitution first, and then selects the category with the highest probability as the judgment result. Therefore, in this study, we adjusted the output range of the image-BC identification results in the first three, which provides the identification scope for BCIM to make a further judgment (Figure 1). During the BCIM construction, in order to consider all the possible output situations of the image-BC recognition method, we combine nine BC types into the combinations, each of which contains three different BC without considering the combination order. According to the BC type contained in each combination, the original sample data is merged to construct the corresponding BCIM (Figure 2). It means that each BC combination has its own original sample data . ( represents the cth BC combination).

3.4. Screening the Representative Problems for BC Identification

Take the BC combination as a unit; the feature selection algorithm screens representative problems to build a dataset for constructing the BCIM. The feature selection algorithm (FS) measures the importance score W of each question in the BC combination, which can be abstracted as the following function:

We set the number k of filtered questions . We pick the top k questions with the highest importance scores from the original questions contained in each the BC combination. Next, we combine them into a new problem set . This means that each BC combination has a specific new problem set . The new question set is the set of questions that the patient needs to answer.

3.5. Construction of the BCIM

The new problem set obtained by the feature selection algorithm is a subset of original questions Q, while the assessment method in the CCMQ needs the answer of all original questions. Therefore, the assessment method in the CCMQ is not applicable to the new problem set . In order to identify the corresponding BC types from it, we construct a BCIM to identify BC based on .

3.5.1. Processing Training Data

The original sample data is filtered by the new question set to get the new sample data . The algorithm is summarized as below Algorithm 1.

Input:
  new problem set , original sample data
output:
  new sample data
initialize to an empty list
for q in do
 get answers to questions q from ;
 add to ;
end for
3.5.2. Construction and Training of the BCIM

For training BCIM, the answers of the new question set in the new sample data , are input into the classifier as the features of the classifier, and the BC is the output of the classifier. The model is continuously updated through iterative training. The operation can be abstracted as the following function:

4. Results

4.1. Dataset

The dataset of the BCIM is collected from the randomly generated samples. The dataset can be organized into the following two parts:(1)Original Dataset. For these 60 questions in the CCMQ, the randomly generated algorithm is used to generate 1,000,000 different answers, and each answer is marked with the BC type according to the CCMQ’s judgment method. A detailed description of the data is shown in Table 1.(2)The Datasetfor Training the BCIM. The BCIM is constructed based on the BC combination. Nine BC types are combined into combinations containing three different BC types, resulting in a total of BC combinations. The feature selection algorithm is used to screen out the top k, questions with the highest importance score in each BC combination. These questions are combined with options and BC labels in the original dataset to obtain a dataset for training the BCIM. There are nine constitution labels in the dataset, including Balanced Constitution, Qi-deficient Constitution, Yang-deficient Constitution, Yin-deficient Constitution, Phlegm-dampness Constitution, Damp-heat Constitution, Stagnant Blood Constitution, Stagnant Qi Constitution, and Inherited Special Constitution.

4.2. Experimental Setup

Filter is a feature selection algorithm that focuses on the general characteristics of the data and independent of the classifier. According to the filter’s ability to score the features of each dimension, we use the chi-squared stats algorithm to evaluate the importance score of each combination of questions and screen them. Models based on linear discriminate analysis (LDA), the artificial neural network (ANN), k-nearest neighbor (k-NN), random forest (RF), and support vector machine (SVM) are constructed and run for identifying BC.

4.3. Evaluation Metrics

In order to evaluate the identification performances of the five aforementioned models, we use a 5-fold cross-validation method to train and evaluate the models. The new sample data of each combination are divided into five equal parts, taking 4/5 of the samples for the training set and the remaining for the test set. Each combination performs training five times and the test samples taken for each time do not overlap. Finally, the mean accuracy of five evaluation results is used as the accuracy of the model. In addition, we use the Macro-averaging (Macro-Precision, Macro-Recall, Macro-F Score) and Micro-averaging (Micro-Precision, Micro-Recall, Micro-F Score) as indicators to evaluate the multiclassification of the classifier. The classifier’s response time is also an evaluation indicator, denoting the calculation speed of the model.

4.4. Results and Analysis

The new sample data are used as training objects to compare the identification performances of the five models. We randomly select 10 BC combinations for comparison. To visually compare the performance of the classifier, we take the highest accuracy of the 5-fold as the accuracy of the model and extract the evaluation metrics data of this point. The performance is shown in Figure 3. The values obtained by LDA, ANN, and SVM models are all above 90%. This shows that identifying BC by feature selection and the classifier is feasible and effective. Notably, in the case of different BC combinations, we found that the number of features used to construct the BCIM with LDA is less than that used by other classifiers. The number of features represents the number of screened questions k.

In order to further show and analyze the performance and difference of the classifier, we select two BC combinations that consist of the most common BC types in the survey [33] as examples from the comparison result. The specific data are shown in Figure 4 and Table 2. The abscissas in Figure 4 represent the number of features used to build the fitness discrimination model. The ordinate indicates the accuracy of the BCIM.

As we can see from Figure 4, the accuracy of all models would change with the number of features k. Models built by LDA and ANN work best, and their accuracies tend to stabilize after reaching the peak. The model built by SVM is not as good as the former two and is affected by the BC type in the combination. The accuracy of the model built by k-NN shows a downward trend after reaching the peak. This is because the increase in the number of features k results in an increase in the number of objects contained in the model and a decrease in the frequency of the correct category of objects. The accuracy of the model built by RF is low and shows disorderly fluctuations. This might be caused by the small number of features extracted by RF, resulting in a small subspace and low variety. In conclusion, the classification effect of LDA, ANN, and SVM models reach the experimental expectation.

In order to further compare the performance of the classifiers, we extracted the points with the highest accuracy of each model (Table 2). The value k represents the number of features used to construct the model making the identification accuracy of the model highest. For the Macro-averaging and the Micro-averaging, in the BC combination of Balanced Constitution, Yang-deficient Constitution, and Qi-deficient Constitution, the model built by ANN gets the highest score. All its accuracy, Micro-Precision, Micro-Recall, Micro-F Score, Macro-Recall, and Macro-F Score are 99.90%, and its Macro-Precision is 99.89%. Models built by LDA and SVM rank second and third. In the BC combination of the Yang-deficient Constitution, Phlegm-dampness Constitution, and the Stagnant Blood Constitution, the model built by LDA has the highest score. All its accuracy, Micro-Precision, Micro-Recall, Micro-F Score, Macro-Recall, and Macro-F Score are 99.84%, and the Macro-Precision is 99.83%. Models built by ANN and SVM rank second and third, respectively. Compared with the performance of the model about response time, the results of the other four classifiers except KNN are acceptable. LDA is the fastest one, whose response time is 0.009 seconds and 0.006 seconds, which means that the projection direction chosen by LDA can well classify the training data, which enables quick judgment during the test. The model built by KNN takes the most time, reaching 47 minutes, which means that the calculation efficiency of KNN is low. We believe that this is determined by the operation characteristics of KNN, which needs to calculate the distance between samples to be tested and each feature, so the calculation time of the classifier increases with the number of features.

Taken together, the performances of models built by LDA and ANN are the best and fit the purpose of our study. Although there is a slight difference between these classifiers, we can see that our proposed method can be applied to most classifiers and has high generalization. In this study, LDA is the best method to construct the BCIM if the number of questions to be answered by the patient is taken into account (Figure 3(h)).

4.5. Performance Comparison

In order to verify the performance of the BCIM constructed by the chi-squared stats algorithm and LDA in clinical practice application, we select the image body constitution identification model (image-BCIM) [8] as a baseline and use the doctor’s judgment and judgment results of the CCMQ as reference standards. This comparison experiment compares the effect of adding inquiry on the accuracy of BC identification based on inspection. Besides, we also record the number of questions and the time of answering them, which will be compared with the CCMQ.

74 volunteers participated in this comparison experiment. During the experiment, according to image-BCIM (simulating the doctor inspection), image-BCIM + BCIM (simulating the doctor inspection and inquiry) and doctor’s judgment, three methods were run for identifying the BC type of the volunteers. Another 70 volunteers performed physical examinations only by filling out the CCMQ. One example is presented in Table 3 to show the actual comparison results. The table is divided into three parts. The first part is the identification result of the image-BCIM. The second part is data from the image-BCIM + BCIM, in which we adjusted the output range of the image-BCIM identification results to the first three, in addition, the BCIM outputs questionnaire questions, identification result, and the time spent on filling and identifying. The third part is the doctor’s judgment results. It should be noted that there will be multiple results in the doctor judgment, all of which are the real result [34]. The human body is often in a subhealthy state with multiple unbalanced constitutions at the same time. This is common in elderly or frail people [35]. Therefore, when the result of the image-BCIM or the image-BCIM + BCIM matches one of the judgments of the doctor, we take it as a correct one.

The comparison results with volunteers’ participation are shown in Tables 4 and 5. In order to enhance the accuracy and persuasiveness of results, the evaluation results are averaged by test results. The values after “±” indicate the standard deviation of test results.

Taking the identification result of the doctor as the reference (Table 4), the accuracy rate of the image-BCIM + BCIM is 77.4%, which is 25.8% higher than that of the image-BCIM. Compared with routine answer time and the number of questions (Table 5), patients had 68.3% fewer questions to answer than that of CCMQ and the time occupied by answering is reduced by 80.3%.

5. Discussion

Our results show that the body constitution identification model based on feature selection and classifier can simulate the doctor inquiry that pushes targeted questions to patients and the subjective patient feelings are collected by it to make further judgments on the patient’s BC. The accuracy of recognizing BC automatically can be improved by combining questionnaire data with image-BC identification. Meanwhile, when collecting subjective feelings, the BCIM pushed targeted scale questions to the patients, which reduced the number of questions and the time needed to complete the questionnaire, and the identification efficiency was higher than the CCMQ. Clinically, doctors need to consider the patient’s objective signs (such as imaging reports and biochemical indicators) and subjective feelings (from the doctor’s consultation) to diagnose disease. According to the comparison experiment results in Tables 4 and 5, we have reason to believe that the improvement in the accuracy of BC identification is due to a comprehensive analysis of the patient’s objective information and subjective feelings.

Simulating the doctor inquiry using the feature selection and the classifier can provide us with a comparatively accurate result. In practice, there are one or two key unbalanced constitutions that dominate people’s current health. These key constitutions will continue to change with people’s habits, the external environment, and the treatment process. When faced with a patient with a composite constitution, analyzing the key constitution that currently affects their health is key to treatment. Although the predicted result may be one of the true BC types of the patient, it provides us with a kind of opinion for reference, which helps doctors quickly pinpoint the key BC type that most affects the patient.

6. Conclusion

This paper adds questionnaire data to image identification and uses the CCMQ to collect subjective feelings of patients to make further judgments on BC. In order to collect the questionnaire data of the patients more fully and effectively, we propose a BC identification method based on a random generation algorithm, feature selection algorithm, and classification method. We combine the method with the image identification method to compare the accuracy of BC identification before and after the combination. The results show that our method can improve the accuracy of BC identification, effectively avoid the shortcomings of the CCMQ when collecting information and improve the efficiency of BC identification. Through the comparison experiment, we show that artificial intelligence technology in the field of medicine to achieve the level of clinical diagnosis, the collection, and comprehensive analysis of objective and subjective factors is essential. The data processed by the method in this paper include but are not limited to the problems in the CCMQ, which is of referential significance to other clinical observation scales related to physiological or pathological indicators of patients.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Disclosure

The funder did not have any role in the design of the study, analysis, interpretation of data, or writing of the manuscript.

Conflicts of Interest

The authors declare they have no conflicts of interest for this study.

Acknowledgments

The authors would like to thank all the participants for their invaluable time. This study was supported by China National Science Foundation (Grant nos. 60973083 and 61273363), Science and Technology Planning Project of Guangdong Province (Grant nos. 2014A010103009 and 2015A020217002), and Guangzhou Science and Technology Planning Project (Grant nos. 201604020179 and 201803010088), and Guangdong Province Key Area R&D Plan Project (2020B1111120001).

Supplementary Materials

Constitution in Chinese Medicine Questionnaire (CCMQ). (Supplementary Materials)