Background

Chronic kidney disease (CKD) in stage 5 also called kidney failure requires kidney replacement therapy (KRT) as part of the treatment, through peritoneal dialysis (PD), hemodialysis (HD), or kidney transplant [1, 2]. During the last decade, a steady increase in the prevalence of KRT for patients with stage 5 CKD has been documented in different countries worldwide, with HD being the most widely used KRT [3,4,5].

Patients on dialysis bear a significant burden of symptoms, experienced as part of the natural course of the disease, or concerning medication, dialysis, or dietary and lifestyle modifications, which are necessary as part of the treatment [6], with an impact on the quality of life (QOL) construct [7], and clearly on health-related quality of life (HRQoL) [8, 9]. The effect of CKD on QoL has been described in numerous studies, with some of the consistently reported findings being, (I) the decreased QoL in patients with CKD compared to the general population; (II) the progressive and significant worsening of QoL in relation to the progression of CKD; and (III) the improved QoL in kidney transplant recipients compared to patients on dialysis [10,11,12,13,14,15]. The role of the HRQoL construct as a significant predictor of morbidity and mortality in patients with CKD has also been widely documented [16,17,18,19,20].

The assessment of constructs, such as QoL, which, by their nature, cannot be assessed by conventional diagnostic tests, requires the use of patient-reported outcomes (PROs) consolidated into valid and reliable instruments called patient-reported outcome measures (PROMs) [21]. For the measurement of aspects related to the health status of CKD patients, numerous generic and disease-specific PROMs are currently available [22, 23]. The quality of the results obtained through a PROM will depend on the measurement properties with which the instrument quantifies the construct of interest in the target population [24]. That is why to use an instrument, it is necessary to carry out the processes of translation and cultural adaptation [25] if it has been originally developed in another population and to have evidence on the measurement properties of the instrument in the population of interest [24]. The Kidney Disease Quality of Life short form (KDQOL-SF) of 80 items and the Kidney Disease Quality of Life short form 36 (KDQOL-36) of 36 item [26, 27] are instruments with the best evidence of adequate clinimetric properties for measuring QoL in CKD patients [28, 29]. This has prompted the translation, cross-cultural adaptation, and validation for its use in different populations around the world [30,31,32,33,34,35,36].

Although the cross-cultural adaptation of the Spanish version of the KDQOL-36 carried out in Colombian patients is available [37], there is no validation of the instrument in the Colombian population to date. Considering the importance of assessing HRQoL in CKD patients on KRT, and the lack of an adequately validated instrument for doing so in Colombia, therefore, this study aimed to establish the clinimetric properties of the KDQOL-36 instrument in Colombian patients with stage 5 CKD on dialysis.

Methods

A scale validation study was conducted from the perspective of classical test theory (CTT).


Participants: Adult patients with a diagnosis of stage 5 CKD on dialysis attended in five renal units of the Baxter Renal Care Services® network in Bogota, Colombia. Patients were recruited by non-probabilistic, sequential, and convenience sampling, applying the following inclusion criteria: (I) being 18 years of age or older; (II) being Spanish speaking; and (III) having been in Colombia for the last 10 years. Patients with cognitive or sensory alterations that prevented the adequate application of the instrument were excluded. Sample size calculation was conducted for each of the components of the scale validation process. For the analysis of the validity of the proposed content using polychoric correlation methods, a sample of no less than 250 patients is suggested [38,39,40], so a total sample of 500 patients was considered, 250 in PD and 250 in HD. For the analyses of concurrent criterion validity, internal consistency, and test–retest reliability, the sample size calculations were performed using the PASS® statistical program, assuming a significance level of 5% and a power of 80%. For the concurrent criterion validity analysis, the estimated sample size was 70 patients, considering a population correlation coefficient of 0.4 for the null hypothesis (H0) and 0.5 for the alternative hypothesis (Ha) [41, 42]. For the internal consistency analysis, a sample size of 101 patients was estimated, taking Cronbach's alpha correlation coefficient values of 0.7 for the H0 and 0.8 for the Ha [43, 44]. For the reliability analysis using the test–retest method, assuming Lin’s concordance correlation coefficient (CCC) of 0.8 for the H0 and 0.9 for the Ha [45], the estimated sample size was 100 patients. For the proposed sensitivity to change analysis using a mixed model for repeated measures (MMRM), the sample size was calculated using the GLIMMPSE® program, taking into account the non-independence of three repeated measures over time and the distribution of patients in two dialysis modalities; considering a difference of at least 2 points and 0.5 points in the standard deviation in the score obtained between the different moments of measurement [46], the estimated sample size was 351 patients [47].


Instrument: The KDQOL-36 is a 5-point Likert scale with a generic core and a CKD-specific component. The generic core is measured by the 12-item Short Form Health Survey (SF-12), which consists of 12 items conducted to the physical and mental components, whose scores are converted into mean scores of 50 and standard deviations of 10, whereby values above 50 indicate a better health status than the reference population [48]. The reliability assessment and estimation of normative values of the SF-12 among Colombian adults are available for this country [49]. The CKD-specific component has 24 kidney disease targeted items, distributed in three domains: burden of kidney disease (4 items), symptoms and problems of kidney disease (12 items), and effects of kidney disease (8 items). Item 28, which is part of the symptoms and problems domain, has two wording options depending on the dialysis modality: 28a Hemodialysis patient only “Problems with your access site?” or 28b Peritoneal dialysis patient only “problems with your catheter site?”. The pre-coded numerical values for each item are linearly transformed to a range from 0 to 100, such that for each domain higher scores indicate a better level of HRQoL [27].


Statistical analysis: The sociodemographic and clinical data of the participants were analyzed by descriptive statistics, using percentages for categorical variables; and means or medians, with the respective standard deviation (SD) or interquartile range (IQR), for continuous variables.


Content validity: It was estimated by sequentially employing the statistical techniques of exploratory factor analysis (EFA) and confirmatory factor analysis (CFA), in two independent subsamples [50, 51]. For the EFA, Bartlett's test of sphericity and the Kaiser Meyer-Olkin test (KMO) were used to check the suitability of the correlation matrix for factor analysis [52, 53]. The number of factors to be analyzed was determined using the Kaiser criterion, the percentage of total variance explained, the eigenvalue sedimentation plot analysis, and parallel analysis [54,55,56,57,58,59]. To define the factor structure, factor loadings ≥0.3 were considered [60]. To the initial orthogonal solution, orthogonal (varimax) and then oblique (promax and oblimin) rotations were performed in order to select the solution with the best clinically interpretable model. For the CFA, considering the ordinal nature of the scale items, the estimation of the models was performed from a polychoric correlation matrix, using the weighted least squares (WLS) method [61, 62]. To assess the goodness of fit of the models, measures of absolute fit and incremental fit [63] were used, with the following specified values indicating adequate fit [64,65,66]: chi-square/degrees of freedom (X2/df; values <3), root mean square error approximation index (RMSEA; values <0.08), standardized root mean square error (SRMR; values <0.08), and values >0.9 for Comparative Fit Index (CFI), Incremental Fit Index (IFI), Tucker-Lewis Index (TLI) or Non-normalized Fit Index (NNFI), and Goodness-of-Fit Index (GFI).


Concurrent criterion validity: Through the concurrent application with another scale that measures the same construct. Considering that in Colombia there is no validated instrument for the evaluation of HRQoL in patients with stage 5 CKD on dialysis, it was considered necessary to perform the translation and cross-cultural adaptation for the Colombian population of the Kidney Disease Questionnaire (KDQ), an instrument also designed to measure the construct of QoL in patients with CKD on KRT, which consists of 26 items distributed in five domains: Physical symptoms (6 items), fatigue (6 items), depression (5 items), frustration (3 items), and relationships with others (6 items), the original version of which is in English [67, 68]. The translation and cross-cultural adaptation processes were performed following the recommendations suggested by the EORTC (European Organization for Research and Treatment of Cancer) Quality of Life Group [25]. Once the two instruments were applied, the scores for each of the domains of both instruments were calculated, the Shapiro–Wilk statistical test was given to determine whether the data had a normal distribution, and then, the Spearman correlation coefficients were calculated between the scores of the CKD-specific domains of the KDQOL-36 and the scores of the domains of the KDQ.


Internal consistency: It was performed by estimating four of the suggested coefficients: Cronbach's alpha, McDonald's omega, greatest lower bound (GLB), and Guttman's lambda [69,70,71], calculated for the CKD-specific core of the scale, for each of the three domains, and through an analysis with item removal.


Reliability: Using the test–retest method, the instrument was applied at a second time 8–10 days after the first test, during which time the HRQoL construct remained stable. Lin’s CCC was used, and the dispersion of the correlation and concordance was evaluated graphically using Bland and Altman's goodness-of-fit plots [45, 72].


Sensitivity to change: We compared the scores obtained in each of the domains of the CKD-specific component of the KDQOL-36 at three different moments of application of the instrument: (I) baseline; (II) when experiencing an event that could modify the HRQoL; and (III) once the event had ended, again considering the stability of the construct. For this purpose, MMRM were used, taking into account the presence of fixed and random effects, given by the non-independence of the three moments of application of the instrument and also by the distribution of patients in two dialysis modalities, with both between-subjects and within-subjects effects.

The CFA and the calculation of Cronbach's alpha, GLB, and Guttman's lambda coefficients were performed using the R programming language, through RStudio version 1.4.1106 using the libraries lavaan, psych, paran, polycor, and semPlot [73,74,75,76,77]. The remaining analyses of the validation and descriptive statistical component were performed with the STATA 17® program.

Results


Characteristics of the participants: The total sample included 506 patients with stage 5 CKD on dialysis, 50% on HD, and 50% on PD. In the total sample, 61% of the patients were male; the median age was 57.73 years (IQR = 43.75–67.21); the characteristics of the participants in the total sample and according to dialysis modality are shown in Table 1.

Table 1 Sociodemographic and clinical characteristics of the study population

Description of CKD-specific component of KDQOL-36 scores: The lowest scores, suggesting a greater compromise or decrease in quality of life, were observed in the burden of kidney disease domain with a median of 43.75 (IQR = 25–75), followed by the effects of kidney disease domain with a median of 75 (IQR = 53.12–87.5), and finally the symptoms and problems of kidney disease domain with a median of 81.25 (IQR = 68.75–91.66). Scores in each domain by dialysis modality are shown in the Supplementary material, Figure S1.


EFA: The total sample of 506 patients was divided by simple random sampling into two, “subsample 1” and “subsample 2,” each consisting of 253 patients including both dialysis modalities. EFA was performed in the first sample and CFA in the second sample. The results of Bartlett's test of sphericity (χ2 (276) = 2010.685; p = 0.000) and the KMO test (0.894) allowed us to conclude that the correlation matrix was suitable for factor analyses. Considering that, in the initial orthogonal solution, the first three factors were found to explain 93% of the variance and had eigenvalues greater than 1.0, the characteristics of the Cattell’s scree plot, and the parallel analysis with the principal factor method, the three-factor analysis was considered adequate. The factorial solution with the best clinical interpretability was the oblique rotation (promax) (Table 2). Factor one with five items that include aspects related to the perception of interference or burden of kidney disease in life; factor two with 11 items, that gather aspects related to the physical symptoms of the disease; and factor three with seven items, that include aspects related to the limitations or effects of kidney disease in daily life. Item 28 “problems with your access site?” or “problems with your catheter site?” showed the highest uniqueness value (0.96) and did not obtain an adequate factor load in any of the three domains.

Table 2 Factor structure of the CKD-specific core in the KDQOL-36 scale, oblique rotation (promax)

CFA: For this component, using subsample 2, the original model reported by the author [25] and the empirical model resulting from the EFA were evaluated. Figures 1 and 2 show the system of structural equations for both factor structures; the ovals represent the latent variables (domains), the squares represent the observed variables (items), the arrows in a single direction indicate the domain-item causal relationships, the arrows in two directions indicate the correlations between domains, and the arrows in dashed lines correspond to loadings that are set with a value of 1 to estimate the coefficients of the models. The goodness-of-fit estimators obtained for each of the models are presented in Table 3, with values for each of the estimators that indicate an adequate fit of both models and are very similar between them.

Fig. 1
figure 1

System of structural equations for the CKD-specific core of the KDQOL-36, original model

Fig. 2
figure 2

System of structural equations for the CKD-specific core of the KDQOL-36, empirical model

Table 3 Goodness-of-fit estimators for the evaluated models

Concurrent criterion validity: Instruments KDQOL-36 and KDQ were applied to 199 patients, 100 patients on HD, and 99 patients on PD. Across the total sample, for each of the three domains of the CKD-specific component of the KDQOL-36, all correlations were statistically different from zero, with values obtained from Spearman's correlation coefficient that were overall >0.50. The highest correlations (ranging from 0.62 to 0.75) were found in the KDQOL-36 burden of kidney disease domain, with the KDQ scores for depression, relationships with others, and frustration; and in the KDQOL-36 symptoms and problems of kidney disease domain, with the KDQ scores for physical symptoms and fatigue; correlations that are clinically plausible and do have an adequate interpretation. Coefficients according to dialysis modality are shown in Table 4.

Table 4 Spearman's correlation coefficients for the CKD-specific core of the KDQOL-36 scale

Internal consistency: The analysis of the CKD-specific component of the KDQOL-36 using the total sample of 506 patients resulted in similar values for the Cronbach's alpha, McDonald's omega, GLB, and Guttman's lambda coefficients, which were between 0.89 and 0.94, indicating a high level of consistency. Likewise, when the analysis was performed for each of the three domains, values for the four coefficients were identified in a range between 0.79 and 0.88. The values obtained for each coefficient in the total sample and according to dialysis modality are shown in Table 5. In the analysis with item removal using the total sample, no increase in Cronbach's alpha or Guttman's lambda coefficients was observed; however, a slight increase in the McDonald’s omega coefficient was observed when item 28 “problems with your access site?” or “problems with your catheter site?” was removed. In the analysis by KRT modality, when item 28 was removed, both in the sample of hemodialysis and peritoneal dialysis patients, a discrete increase in the level of consistency was observed for Guttman’s lambda and McDonald’s omega coefficients. The values of Cronbach's alpha, McDonald's omega, and Guttman's lambda coefficients obtained in the item removal analyses are shown in the Supplementary material, Tables S1, S2, and S3.

Table 5 Internal consistency coefficients for the CKD-specific core of the KDQOL-36 scale

Test–retest reliability: The KDQOL-36 instrument was applied at a second time to 200 patients, 100 patients on HD, and 100 patients on PD, with a median of 8 days between the two assessments (IQR = 8–10). The analysis of Lin's CCC both in the total sample and by KRT modality for each of the three domains, resulted in all cases in coefficients that were statistically different from zero, with values ≥0.7 (Table 6). In the Bland and Altman plots for each of the three domains, it is evident that the average difference between the first and second application is minimal, with a high level of agreement that remains stable for the entire measurement range of the instrument, being higher for the domain effects of kidney disease (Fig. 3).

Table 6 Lin's concordance correlation coefficients for the CKD-specific core of the KDQOL-36 scale
Fig. 3
figure 3

Bland and Altman limits of agreement for the CKD-specific core of the KDQOL-36 scale (On the y-axis, zero [y = 0] represents the line of perfect average agreement)


Sensitivity to change: The KDQOL-36 instrument was applied at three different times: (I) baseline; (II) when experiencing an event that could modify the quality of life; and (III) once the event was over, again considering the stability of the construct, to 351 patients, 92% in hemodialysis (n = 324 patients) and 8% in peritoneal dialysis (n = 27 patients). The scores obtained in each domain at the three time points are shown in the Supplementary material, Table S4. The analysis using MMRM resulted for each of the three domains in statistically significant differences between the scores obtained with the application of the instrument at different points in time, which shows the instrument's capacity to detect changes in the measurement of the construct as it changes. Coefficients, 95% confidence intervals, and p values obtained for each of the three domains are shown in Table 7. Supplementary material, Table S5, shows the values obtained in the pairwise comparisons by dialysis modality.

Table 7 Repeated-measure mixed models for the CKD-specific core of the KDQOL-36 scale

Discussion

Given the need to incorporate QOL as a health outcome during the care of patients with stage 5 CKD on dialysis [78, 79], it is necessary to have evidence of the psychometric properties of the instruments used to measure this construct in each target population [24, 25]. Likewise, it is crucial to generate evidence on the use of more advantageous, complementary, and widely recommended statistical methods to further advance and improve the quality of studies on the measurement properties of PROMs.

The sociodemographic and clinical characteristics of the study population are consistent with the data presented in the latest report published by the Colombian Fund for High-Cost Diseases on the status of CKD in Colombia [5], which additionally reports Bogota as the region with the highest estimated prevalence of KRT in the country, suggesting an adequate representativeness of the study population.

The validity of the instrument was adequate, with evidence of content and concurrent criterion validity. The EFA confirmed the factorial structure proposed in the original instrument for the CKD-specific core of the KDQOL-36, with three factors or domains regarding the burden of kidney disease, physical symptoms, and effects of the disease. For item 28 “problems with your access site?” or “problems with your catheter site?”, no adequate factor loading was found in any of the three factors, with a high uniqueness value; additionally, a discrete increase in the level of consistency was found when it was removed, suggesting that this item could be measuring an aspect other than burden, physical symptoms or effects of the disease, as part of the QoL construct. Despite using a less conservative factor loading threshold than the one used in the present validation, this same finding was evident in the validations carried out in Chinese patients [31], and in Malaysia [35], in which the loading of this item 28 on any of the three identified factors was also not reported. The CFA, performed on an independent subsample, supported the structure of the three factors or domains mentioned, with an adequate fit of the original and exploratory models, finding adequate values for each of the estimators, which were very similar between the two models. Regarding the concurrent criterion validity, despite the differences in the structure and number of items between the CKD-specific core of the KDQOL-36 and the KDQ, an adequate correlation was found between the domains of both instruments, with values of Spearman's correlation coefficient overall ≥0.50. This finding is consistent with what was reported in the validations of the instrument carried out in Arabia, Malaysia, and Ethiopia, which used evidence of different types of validity, such as convergent construct validity and discriminant construct validity [34, 36].

The instrument was reliable, showing evidence of internal consistency and test–retest reliability. In the internal consistency analysis, values for Cronbach's alpha, McDonald's omega, GLB, and Guttman's lambda coefficients were found that indicate a high level of consistency of the three CKD-specific domains of the KDQOL-36. This finding is consistent with what was reported in the studies carried out to validate the instrument in dialysis patients in Thailand [30], China [31], the United States [32], Indonesia [33], Arabia [34], Malaysia [35] and Ethiopia [36] in which Cronbach's alpha coefficient was used as the only measure of internal consistency. It is worth mentioning the discrete increase in the McDonald's omega and Guttman's lambda coefficients when item 28 “problems with your access site?” or “problems with your catheter site?” was removed. Regarding test–retest reliability, the calculation of Lin's concordance correlation coefficient for each of the kidney disease-specific domains of the KDQOL-36, and its corresponding graphical analysis of Bland and Altman limits, allowed us to confirm the stability of the measurements obtained with the instrument at two separate moments in time, considering that the construct remained stable. This finding is consistent with what was reported in validation studies of the instrument carried out in dialysis patients in Thailand [30], China [31], Arabia [34], Malaysia [35], Indonesia [33], and Ethiopia [36] in which evidence of test–retest reliability was generated, although its estimation was carried out using the intraclass correlation coefficient (ICC) or Pearson correlation coefficient. Concerning sensitivity to change, statistically significant differences were found between the scores obtained with the application of the instrument when experiencing an event that could modify the quality of life, which corroborated the instrument's capacity to detect changes in the measurements of the construct as it varies.

The validity and reliability findings of the CKD-specific core of the KDQOL-36 scale in the Colombian population using more advantageous, complementary, and widely recommended statistical methods are consistent with the data presented in validation studies conducted in other countries [30,31,32,33,34,35,36]. Additionally, we observed findings of adequate sensitivity to change of the CKD-specific core of the KDQOL-36 scale in the Colombian population, that to the best of our knowledge, at the time of this work, none of the validation studies of the CKD-specific core of the KDQOL-36 scale have included evidence of this psychometric property of the instrument.

A possible limitation of the present study is the small sample size that was possible to obtain for the evaluation of sensitivity to change in PD patients. For further studies of this instrument, we propose the evaluation of sensitivity to change in a larger sample of patients on PD. Also, it is important to evaluate the additional psychometric properties from the perspective of item response theory, such as item- and person-fit indexes, the evaluation of person and item reliability, and the analysis of the coverage of the construct spectrum with the items of the scale.

Conclusions

The findings from this study allow us to conclude that the KDQOL-36 scale is an instrument with adequate validity, reliability, and sensitivity properties to measure the construct of quality of life in Colombian patients with stage 5 chronic kidney disease on dialysis.