Introduction

Liver function tests, including assays for alanine aminotransferase (ALT), aspartate aminotransferase (AST), alkaline phosphatase (ALP), γ-glutamyltansferase (GGT), total protein (TP), albumin (ALB), total bilirubin (TBIL), and direct bilirubin (DBIL), are generally ordered as a “test of exclusion” in patients with non-specific symptoms or as part of routine health checks. In children, reference intervals of liver function tests should reflect the different phases of physiological development from birth to adolescence. However, appropriate age- and gender-specific pediatric reference intervals for most tests are often incomplete.

Reference intervals are basic dimension to explain laboratory tests and analyze tests information. Proper and accurate laboratory tests and reference intervals are vitally necessary for medical assessment and care of patients. Current guidelines define a reference interval as the interval between two values in which 95% of the results for apparently healthy individuals would fall, usually between the 2.5 and 97.5 centiles of the distribution of test results for a reference population [1].

For several years, there are few systematic reference interval studies based on Chinese people. The reference intervals now applied in Chinese laboratory are mainly based on European and American populations study tens of years ago. Until now, there are still no reference intervals about pediatric population. As a completely different group, children have different physical size, organ maturity, nutrition and metabolism status with adults. Thus, it is inappropriate for children and adult to use a common reference interval. Pediatric reference interval should reflect children’s different phases of physiological development from birth to adolescence which can profoundly influence the concentrations of liver function tests routinely measured in the clinical laboratory.

Recent studies including the Canadian Laboratory Initiative on Pediatric Reference Intervals (CALIPER) and reports from America, Sweden and Taiwan of China have added some novel information [2,3,4,5]. For example, the CALIPER has established a comprehensive covariate-stratified reference intervals based on a healthy nonhospitalized, and multiethnic pediatric population [3]. However, it is not appropriate to directly apply these reference intervals to Chinese pediatric population, because some parameters may vary significantly among different races and regions. Furthermore, several studies have decided to recruit participants who have been hospitalized or approximately “healthy” individuals due to the difficulties recruiting study participants [6,7,8]. Nevertheless, with a view to physical, nutritional, and developmental status, it is also not appropriate to apply these reference intervals to common pediatric population. Thus, taking into account finding from these studies, the present study will recruit children from communities and schools for conventional check-up before starting entering nursery, kindergarten, elementary school, and middle school. Moreover, taking into account other significant covariates of age, gender, region and ethnicity on pediatric reference intervals, this study presents age- and gender-specific pediatric reference intervals for liver function tests based on definitely healthy reference individuals in Changchun, China.

Methods

Subjects

From February 2015 to March 2016, healthy children aged from 2 to 14 years were recruited in this study. In order to obtain samples from healthy children, the recruitment of study population aged from 2 to 5 years took place in the community and health-care center (Lvyuan Maternal and Children Health Hospital) for common physical examination which national rules require before going to nursery. The children with age range of 6–14 years were enrolled in primary school (Lvyuan District Primary School of Changchun) and middle school (No. 78 Middle High School of Changchun). All the serum samples were collected after obtaining parental permission. All the potential participants or their parents completed a questionnaire including questions on diseases, prescribed and over-the-counter medication, the presence of fever, allergy and eczema, and a general question concerning subjective health. The demographic data included diet, exercise status, ethnicity, and body mass index parameters. Participants were excluded if they had a history of illness such as acute infection, metabolic or any system disease, surgery experiences within 6 months, use of prescription medications over the previous 2 weeks. The included children would be further removed according to the following testing criterion: positive in hepatitis B surface antigen, hepatitis c virus antibody, and human immunodeficiency virus antibody; creatinine above 120 μmol/L; creatine kinase above 500 U/L; uric acid above 475 μmol/L; glucose above 7.0 mmol/L; C reaction protein above 12.0 mg/L. Therefore, a total of 1394 individuals, 649 females and 745 males, were included.

This study was approved by the institutional ethics committee of the First Hospital of Jilin University. For all the children, their parents provided written informed consent and when possible, child subjects provided written assent.

Specimen collection and handling

All participants kept a normal diet and exercise in the last 3 days. Before blood collection in the morning, each participant fasted for at least 8 hours over night. Samples were collected in serum separator tubes (SST™; BD), then centrifuged, separated and aliquoted within 2 hours of collection. Excluding unqualified samples which were hemolysis, lipidemia, and jaundice, all qualified samples were sent to the laboratory for examination as soon as possible.

Instruments, reagents and methods

All 8 liver function tests were performed on Hitachi 7600-210 automatic analyzer (Hitachi High-Technologies, Tokyo, Japan). ALT (UV-LDH method), AST (UV-MDH method), GGT (l-γ-glutamyl-3-carboxy-4-nitranilide method), and ALP (AMP buffer method) were measured using reagents from Kehua bio-engineering company (Shanghai, China); TP (biuret reaction), ALB (bromocresol green method), TBIL, and DBIL (DCA method) were measured using reagents from DiaSys Diagnostics (Shanghai, China).

Quality control

Sample determinations were carried out in the Department of Laboratory Medicine at the First Hospital of Jilin University. The Laboratory was accredited to ISO 15189:2012 Medical Laboratories-Particular Requirements for Quality and Competence by China National Accreditation Service for Conformity Assessment in 2012. All the doctors, technicians, and nurses participating in the physical examination and sample analysis were through strict training. Hitachi 7600-210 automatic analyzer were calibrated per 12 months by the manufactures, so as to ensure the performance of detection system like precision, accuracy, analytical measurement range, clinical reportable range and carryover rate were controlled.

Validation of reference intervals

The established pediatric reference intervals are based on the population living in Changchun. As recommended by the Clinical and Laboratory Standards Institute (CLSI) C28-A3 [1], the reference intervals need to be assessed applicability for different testing conditions. We performed the validation in five representative hospitals located in different areas in Changchun including Lab 1 (The First Hospital of Jilin University), Lab 2 (The branch, First Hospital of Jilin University), Lab 3 (The Second Hospital of Jilin University), Lab 4 (The Fourth Hospital of Jilin University), and Lab 5 (The Pediatric Hospital of Changchun). For each subgroup of reference intervals 20 healthy participants were recruited regionally for each of the five hospitals. Routine instruments, reagents and methods were used to test samples from recruited individuals in each laboratory. For each reference subgroup, the results of analytes were validated with the newly established reference intervals. If no more than 10% of the results outside the limits, the established reference interval was considered applicable for the population.

Statistics analysis and determination of reference intervals

Statistical analysis was performed with EXCEL (Microsoft) and SPSS 21.0 (IBM) software and in accordance with CLSI C28-A3 guideline [1]. Briefly, the data were inspected using scatter and distribution plots; outliers were removed with the Dixon’s rule; distribution was identified with the Shapiro–Wilk Statistic. Age and gender partitions were first determined by visually inspecting the distribution and scatter plots for overall trends. It was then justified with Harris and Boyd’s test which is recommended currently by the CLSI [1]. According to Harris and Boyd’s method [9], we calculated Z values with the standard deviation and a modified Z-statistic for two groups to determine whether each group is sufficiently different statistically to its own grouping. If the results of Harris and Boyd’s test did not demonstrate partition, the two groups were then combined and revaluated. The reference interval of each group was calculated using the nonparametric method to calculate the lower and higher reference limits, respectively. For each reference interval, 90% confidence intervals were calculated for either ends.

Results

A total of 1394 samples from 649 males and 745 females (2–14 years) were used to calculate age- and gender-specific reference intervals, and the male: female ratio was 1:1.15. The 8 liver function tests in this study were measured on Hitachi 7600-210 automatic biochemical analyzer, along with the assay of detection, as specified in the manufacturer’s package insert. The cumulative frequency of each measured value for each partition was calculated. After removing outliers, the value for each partition followed Gaussian distribution. With the Harris and Boyd’s test, we observed that it was necessary to derive separate reference intervals by age and gender in children. For each reference interval, 90% confidence intervals were calculated for either ends. Table 1 shows the 2.5th and 97.5th percentiles for 8 liver function tests (serum enzymes, proteins, and bilirubins) of each age and gender group. Table 2 includes both the calculated age- and gender-specific reference intervals for respective analytes of combined groups and points for each interval. As Table 1 shows, most analytes required some amount of partitionings, by age, gender, or both. There were apparent age variations of the reference intervals for all 8 liver function tests. Interestingly, all four enzymes required no gender partition within the initial several age groups but required in the later age groups. However, bilirubins (TBIL, DBIL) and proteins (TP, ALB) displayed no gender partition in the whole groups. All the analytes required a minimum of 2 age-specific reference intervals. ALB only required two different age partitions: 2–10 years, and 11–14 years. However, AST required 7 different age- and gender-specific partitions: 2 years, 3–5 years, 6 years, 7–12 years (female), 7–12 years (male), 13–14 years (female), and 13–14 years (male).

Table 1 Age and gender specific 2.5th and 97.5th percentile results for 8 liver function tests in healthy children aged 2–14 years (n = 1394)
Table 2 Age- and gender-specific reference intervals for 8 liver function tests in healthy children aged 2–14 years (n = 1394)

Within these analytes, TP, TBIL, and DBIL showed steady increases, and AST showed apparent decreases in analytes concentration over time, whereas ALT, GGT, ALP, and ALB demonstrated complex trends of change in these time (Figs. 1, 2). ALT and GGT increased sharply in males from 11 to 14 years. However, ALP markedly declined in females from 13 to 14 years.

Fig. 1
figure 1

Trends in analyte concentrations of serum ALT, AST, GGT, and ALP in healthy males and females aged 2–14 years (n = 1394). Levels of serum ALT (a), AST (b), GGT (c), and ALP (d) in healthy children. Data are presented as mean ± SE. ALT alanine aminotransferase, AST aspartate aminotransferase, GGT γ-glutamyltransferase, ALP alkaline phosphatase, SE standard error

Fig. 2
figure 2

Trends in analyte concentrations of serum TP, ALB, TBIL, and DBIL in healthy males and females aged 2–14 years (n = 1394). Levels of serum TP (a), ALB (b), TBIL (c), and DBIL (d) in healthy children. Data are presented as mean ± SE. TP total protein, ALB albumin, TBIL total bilirubin, DBIL direct bilirubin, SE standard error

As Table 3 shows, all five laboratories passed the validation for each age and gender subgroup of pediatric reference intervals and all the validation data fell within the range of intervals for those 8 liver function analytes. The validation results indicate that the newly reference intervals are applicable in most representative laboratories in Changchun.

Table 3 The results of reference interval validation in 5 laboratories

Discussion

Age- and gender-specific pediatric reference intervals were established non-parametrically for 8 liver function tests assayed on Hitachi 7600-210 automatic analyzer, sampled from 1394 healthy children in Changchun. This is a significantly larger reference set than used in other studies of this type [10,11,12,13].

Defining pediatric reference intervals is one of the most difficult tasks for physicians because of the difficulties in collection of blood samples from a sufficient large number of healthy children. Such difficulties are reflected by the fact that recent studies obtained samples by collecting hospital-based results or using surplus serum that come from children attending selected out-patient clinics, such as dentistry, plastic surgery, orthopedics [6,7,8]. CALIPER initiative has ever established pediatric reference intervals derived from the analysis of outpatient clinics but deemed to be metabolically stable [14]. However, we can not certain whether nor not these samples were free of affecting the measured analytes parameters. In the present study, we collected direct samples from children who were accepting common physical-examination in community health-care centers, schools and fitted into by questionnaires and the following inclusion criteria. In some periods of life such as neonates (0–1 month old) and the first year of life (1 year old), the continuously developing physiology of growing changes their laboratory values. Unfortunately, data from the short phase after birth was lacking and puberty aged from 13 to 14 years was insufficient in the present study.

In the present study, age-related changes in analyte concentrations were observed more commonly than gender associated differences. These analytes showed multiple separated age-related reference intervals. The verifying reference intervals established in the current study reflected pediatric development and growth changes during childhood. Within the analytes studied, proteins (TP, ALB) and bilirubins (TBIL, DBIL) reported no gender partitioning. Therefore, only age is required. This finding is consistent with previous studies for 2–14 years pediatrics subjects [3, 7, 15].

Except for age-related intervals, four enzymes, ALT along with AST, GGT and ALP, required additional gender-stratified reference intervals. ALT and AST do not reveal a distinct gender dependency during the childhood, for ALT in males an obvious increase up from 11 years to juveniles can be seen (Fig. 1). This finding differs from a previous study which described reference intervals for ALT according to gender throughout 2–14 years old but no age-related difference [7]. For AST, a continuous decrease in concentration in children from 2 to 14 years was also observed and showed gender differences (Fig. 2) which was consistent with previous studies [3, 16, 17]. ALP showed gender difference from 13 years old and was different from previous studies [8, 18]. ALP has also been known to reflect height velocity [19]. In this study, the average age at peak height velocity is 11–12 years in females. A marked decrease of ALP in females aged 13–14 years may reflect a reduced height velocity. However, previous studies reported different age class of gender difference [7, 8, 18]. The deviations from study to study and area to area highlight the importance of determining ethnicity specific intervals.

Although previous studies indicated that the levels of serum TP and ALB differed between males and females in certain age groups [3, 20], our data did not show significant statistic difference in gender groups. Furthermore, considerable age partitioning was also required for the proteins (TP, ALB) as well as bilirubins (TBIL, DBIL) and enzymes (ALT, AST, GGT, and ALP). For each of these analytes, they required a minimum of two age-specific reference intervals. Within these groups of analytes, TP, TBIL, DBIL, AST and GGT showed steady increases or decreases in concentration throughout time, whereas ALT, ALB, ALP showed a complex age-related pattern of change (Figs. 1, 2). This marked changes and fluctuations in children during their growth and development highlight the importance of establishing age-specific pediatric reference intervals.

All five laboratories passed the reference interval validation for each liver function analyte in this study. The results suggest that the newly established pediatric reference intervals are valid for most of the children in Changchun and can be used directly by most of the hospitals with guaranteed accuracy. However, considering the different populations and nationalities from other areas in Jilin province, we have already started to recruit reference individuals from more areas in Jilin province for further validation. We aim to establish pediatric reference intervals for major population who under treatment in Changchun.

Although the partitions of reference intervals are required based on firm statistical analysis, it is unclear whether these statistic differences are medical relevant. Thus, the reference interval data need to be updated every few years as the reference population change with environmental and nutrition factors. Besides, the diversity of detection system could lead to differences of test results. In the present study, these intervals are Hitachi 7600 method specific. They might need to be transferred when used in any laboratory that requires pediatric reference intervals if they are transferable to different methods as recommended by CLSI C28-A3 [1]. Completion of these transferences will allow a broader application of reference intervals developed through our study. The new database may also be of rural benefit and likely be used by hospital laboratories in other cities, although intervals should also be validated with local populations and transferred with recommendation by the CLSI [1].

In summary, this study provided updated age- and gender-specific pediatric reference intervals for 8 basic liver function tests using a modern analytical chemistry platform and a large cohort study of healthy children. All analytes required age partitioning. Proteins (TP, ALB) and bilirubins (TBIL, DBIL) did not require gender. In contrast, considerable gender partitioning was required for ALT, AST, GGT, and ALP. The marked changes and fluctuations in children during their growth and development make it important to determine age- and gender-pediatric reference intervals and, therefore, enhance the diagnostic safety in clinical laboratory.