Comparison common equations for LDL-C calculation with direct assay and developing a novel formula in Iranian children and adolescents: the CASPIAN V study

Hypercholesterolemia is a common dyslipidemia that leads to atherosclerosis. It is proved that early stages of atherosclerosis begins in early stages of life. In several studies, widespread prevalence of dyslipidemia in children is reported. So, assessment of lipid profile in children and adolescence is necessary for early diagnosis of dyslipidemia. Laboratory methods for measuring LDL are not available and economical. So, in some laboratories Friedwald method is used to determine LDL level. But, the preciseness of this method is not acceptable. Further, the preciseness of this method was not assayed in children and adolescence. So, it seems that assaying the preciseness of different methods is necessary. The methodology of this work is on the basis of findings of the Caspian V study. This study was conducted in 30 provinces of Iran during 2015. The population of this work was rural and urban students aged 7–18 years old. The level of total cholesterol (TC), HDL, LDL, and TG were measured using laboratory methods. The average and variances values were determined for each group of data using SPSS. Further, LDL values were calculated with a new formula introduced in this work. A comparison was made between the new formula and the other methods. In the present study, we found that compare to four common formulas, Friedwald was the best equation to estimate LDL-C concentrations in Iranian children and adolescents and the new formula was the next accurate equation. The strongest correlation between Friedwald and the new equation was found for those with 15–18 years old. Considering the cut-off points of TG (100 mg/dL), we observed the strongest correlation between Friedwald equation and direct assay and the weakest one was for Ahmadi formula in subjects with either greater or lower TG concentrations. Furthermore, we found that Anandraja equation had the most sensitivity (89.5%), while the most specificity was dedicated to the new formula (98.9%).


Introduction
According to pediatric advisory groups, selective screening for dyslipidemia in children is recommended particularly in those aged 2-18 years old with parents with precholestrolemia or other risk factors such as obesity and smoking. As there is an association between hyperlipidemia and cardiovascular diseases, controlling lipid profile can be helpful for primary and secondary preventions [1]. Since the basis of atherosclerosis and cardiovascular diseases start at early childhood, paying attention to lipid status is appreciated [2].
One of the crucial parameters used for CVD risk assessment is the serum level of low density lipoprotein-Cholesterol (LDL-C) [3,4]. Various methods are used for measuring LDL-C concentrations. Although gold standard for LDL-C measurement is Ultracentrifugation following by betaquantification [5], it has several limitations. It is an expensive and time-consuming method that needs special equipments [6]. Therefore, it is not a common method for routine clinical measurements. Instead, other direct methods including homogenous assay techniques are usually used as well as various equations such as Friedewald, Chen and Anandaraja [7].
Although Friedewald formula is wildly used for reporting LDL-C, it features a fixed triglyceride (TG): very-low-density lipoprotein cholesterol (VLDL-C) ratio of 5:1. Accordingly, it cannot show the substantial inter-individual variability in TG: VLDL-C ratios [8]. Besides, the Friedewald equation is not applicable for those with fasting TG equal or higher than 400 mg/ dL, and often this equation underestimates LDL-C concentrations in subjects with TG equal to 150 mg/ dL. Another limitation of Friedwald formula is related to being 8 h overnight fasting that is usually difficult for children. Finding non-fasting measurement method with acceptable accuracy for LDL-C is practically preferred in children [9].
Several studies compared the amount of LDL-C concentration obtained from formulas with each other or with a direct measurement [6,[8][9][10][11][12][13]. However, it seems most studies examined adult populations. To the best of our knowledge, there is few studies in which common equations for LDL-C calculation were compared with direct assay in children and adolescents at national level.

Study population and sampling framework
The present cross-sectional study was conducted on a sub sample from the CASPIAN V study, a population-based study in Iran, on students aged 7 to 18 years old. To choose eligible individuals, multistage, cluster sampling method was used from 30 provinces in 2015. Details of sampling procedures was presented elsewhere [14]. Briefly, in each province, children and adolescent considering equal number for boys and girls stratified based on living place (urban/ rural) as well as the level of education (primary /secondary). To reach the calculated number of participants, multistage, stratified cluster sampling method was also applied in each province. The size of cluster was 10 (10 students with their parents). Of 14,400 students in the CASPIAN study, 3844 students were selected for biochemical measurements. It means 14 out of 48 clusters from each province were randomly selected for the current study.
In the first step, for eligible students and their parents, sufficient explanations regarding the purpose of the study and the procedures were provided. Then, written informed consent and verbal consent were obtained from parents and students, respectively. All assessments were performed for subjects who completed the written informed consent.
Health-care professional team asked characteristics of participants and completed all questionnaires at schools in a room, where away from busy classrooms and interviewing with at least one of students' parents.

Biochemical assessments
Eligible students with at least one of their parents were referred to the laboratory for biochemical tests. After 12 h overnight fasting, 6 mL venous blood sample was collected from students. All blood samples were centrifuged at 2500-3000×g for 10 min and then serum samples were aliquot and stored at − 70°C till measurement. Lipid profiles including TG, total cholesterol (TC), LDL-C and high-density lipoprotein-cholesterol (HDL-C) were measured using enzymatic method by Hitachi Auto Analyzer (Tokyo, Japan).

LDL-C calculation
Apart from the measurement of LDL-C in serum samples, the amount of LDL-C was calculated using 4 common formulas as represented in Table 1.
Statistical analysis was performed on the data obtained from this assessed population. Accordingly, a regression model was developed. The developed model is as follows: where x LDL-C , x TC , x TG , and x HDL-C are values of LDL-C, TC, TG, and HDL-C, respectively. It is worthwhile noting that this developed model was obtained from the subjects with TG < 100 mg/dL. Note that this regression model was extracted from data using SPSS software (SPSS Inc., USA). The mentioned equation was then examined and validated on children and adolescents with TG > 100 mg/ dL.

Statistical analysis
The correlation among equations and with a direct measurement was examined. Findings were reported in subjects with TG > 100 mg/dL and < 100 mg/dL, separately.
The coefficient of determination is an index for assessing the correlation between actual and predicted values. This index is calculated as follows: where x act. , x pred. , and n are the actual value, predicted value, and number of data, respectively. Considering 110 mg/dL as a cut-off, sensitivity and specificity for each formula in all participants as well as those with low (< 100 mg/dL) and high TG (> 100 mg/ dL) were presented. Youden index, (sensitivity + specificity)-100, was calculated for each in order to identify the best formula for LDL-C calculation in Iranian children and adolescents.

Results
Findings are presented for 3844 children and adolescents categorized based on gender, age and residential place. As depicted in Table 2, the frequency of subjects with TG > 100 mg/dL was greater than those with TG < 100 mg/dL in all age categories (higher than 70% for all). The percentage of boys with TG > 100 mg/dL was higher than girls (73.1 vs.71.4%). However, the difference between genders was not considerable. Classifications by residential place showed that participants lived in rural places had higher TG concentrations than whom resident in urban regions (73.5 vs.71.8%, respectively).
Based on Table 3, there were no significant differences in LDL-C concentrations obtained from formulas except Anandraja (p = 0.18) when subjects were classified based on the cut-off point of 100 mg/dL for TG.
In Table 4, the correlation between predicted formulas with each other and direct assay are provided. In general, Friedwald formula (r = 0.982) stood at the first rank for the correlation with direct assay and the second rank was dedicated to the new formula (r = 0.978). The lowest correlation was observed for Ahmadi formula (r = 0.553). In subjects with TG > 100 mg/dL, the strongest correlation was found between direct assay and Friedwald equation (r = 0.979) followed by the new formula (n =   0.974). The weakest correlation was observed for Ahmadi formula (r = 0.839). Similar findings were obtained for those with TG > 100 mg/dL. However, stronger correlation was obtained for Friedwald (r = 0.986) and lowest association was seen for Ahmadi formula (r = 0.553) compared to whom with TG < 100 mg/dL. As presented in Table 5, the strongest correlation between Friedwald and the new equation was found for those with 15-18 years old (r = 0.987), while the weakest correlation was related to Ahmadi formula for whom with 11-14 years old (r = 0.545).
Sensitivity and specificity of various methods are provided in Table 6. In general, we found that Anandraja equation had the most sensitivity (89.5%), while the most specificity was for the new formula (98.9%). Considering the Yuden index, Friedwald obtained the first rank (86.2%) that is followed by the new formula (84.2%). After classification by cut-off point of 100 mg/dL for TG concentrations, it was revealed that the most amount for Yuden index was for Friedwald for both categories (TG < 100 mg/dL: 86.6%; TG > 100 mg/dL: 85%).

Discussion
In the present study, we found that compare to four common formulas, Friedwald was the best equation to estimate LDL-C concentrations in Iranian children and adolescents and the new formula was the next accurate equation.
One of main identified potential risk factors for CVD and atherosclerosis in adulthood is high concentration of LDL-C. Research in children and adolescents has revealed that monitoring lipid profile status at young age  can be helpful to prevent CVD at adulthood [15]. Accordingly, studding on various methods to find the most accurate one can be helpful to reduce CVD events.
To the best of our knowledge, most studies on comparing formulas to estimate LDL-C concentrations have been conducted on adult populations [6,7,12,13,16,17]. Martin et al., examined four equations including Friedewald, Chen, de Cordova, and Hattori compare to direct measurement in hospitalized patients in South Africa. They found a favorable correlation between the de Cordova formula and Friedewald at low TG concentrations. However, the Hattori formula was the best equation to estimate LDL-C in hospitalized patients, even at extreme lipid values [13]. According to Wadhwa et al.,'s study, among 7 formulas, Friedewald, Cordova, Vujovic, Ahmadi, Anandaraja, Puavillai and Hattori, Vujovic formula was the most accurate one in Indian adult population [18]. Krishnavena et al., also reported that Friedwald correlated maximally with direct measurement of LDL-C at all levels of TG except at TG less than 100 mg/dL in an Indian adult population. They found that for subjects with serum levels of TG < 100 mg/dl, Anandaraja's Formula was the most accurate equation [19]. Different findings between our study and the aforementioned ones are likely to be due to differences in age range, race, and different estimation formulas.
Ahmadi et al., reported that in Iranian adult subjects with low TG concentrations and undesirably high TC, Friedewald equation may overestimate LDL-C. Therefore, they suggested a new formula for such subjects and named it as Admadi formula [20]. Although Ahmadi equation was developed based on Iranian adult populations [20], we found that it cannot be appropriate for children and adolescents and it showed the lowest correlation with direct measurement (r = 0.553). Accordingly, we can conclude that considering age range plays a crucial role on choosing an accurate estimation formula.
It seems only one study compared LDL-C formulas in subjects younger than 18 years old [9]. Garoufi et al., compared calculated LDL-C using Anandaraja and Friedwald formulas with directly measured LDL-C in 1005 healthy and dyslipidemic children (age range: 2-18 yrs. old) in Greece. They showed that using Friedwald formula, serum levels of LDL-C was lower than the measured value in 75.6% of healthy and in 77.3% of dyslipidemic children. They also found that Friedwald formula was more accurate screening tool compared to Anandaraja equation in healthy participants, while Anandaraja was more appropriate for following-up dyslipidemic children [9]. Our findings were in line with the mentioned study. In our study, Friedwald equation was the most accurate one. However, we did not do a classification based on LDL-C to compare healthy and dyslipidemic children. In addition, the correlation between Friedwald and direct assay was a little bit greater in our study compare to Garoufi et al'.,s study (0.98 vs. 0.97).
Although Frielwald formula has several limitations, it seems this formula is still the most accurate one compare to the other four formulas in our children and adolescent society.
Our study had several limitations. First, we did not use a reference method to measure LDL-C. Second, the comparisons were conducted only among four common formulas and we cannot make a decision regarding the accuracy of other estimation formulas. Third, we cannot clarify whether the new formula can be accurate for non-fasting measurements or not. However, the present study seems to be the first study to compare estimation LDL-C formulas among children and adolescents at national levels in Asia. We also compared estimation formulas for both lower and higher TG values. In addition, we developed and introduced a new formula with relatively similar accuracy to Friedwald on a representative sample of our children and adolescent society. In the present study, apart from correlations of equations with direct assay, sensitivity, specificity and the Yuden index for each were also reported.

Conclusion
It is concluded that Friedwald was the best equation to estimate LDL-C concentrations in Iranian children and adolescents and the new formula was the next accurate equation. In addition, Friedwald formula was the most accurate formula to estimate LDL-C in children and adolescent with either low or high TG values.