Comparing methods for risk prediction of multicategory outcomes: dichotomized logistic regression vs. multinomial logit regression

doi:10.21203/rs.3.rs-3911212/v1

Download PDF

Research Article

Comparing methods for risk prediction of multicategory outcomes: dichotomized logistic regression vs. multinomial logit regression

https://doi.org/10.21203/rs.3.rs-3911212/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Background

Medical outcomes of interest to clinicians may have multiple categories. Researchers face several options for risk prediction of such outcomes, including dichotomized logistic regression and multinomial logit regression modeling. We aimed to compare these methods and provide practical guidance needed.

Methods

We described dichotomized logistic regression and competing risks regression, and an alternative to standard multinomial logit regression, continuation-ratio logit regression for ordinal outcomes. We then applied these methods to develop prediction models of survival and growth outcomes based on the NICHD Extremely Preterm Birth Outcome Tool model. The statistical and practical advantages and flaws of these methods were examined and both discrimination and calibration of the estimated models were assessed.

Results

The dichotomized logistic models and multinomial continuation-ratio logit model had similar discrimination and calibration in predicting death and survival without neurodevelopmental impairment. But the continuation-ratio logit model had better discrimination and calibration in predicting probabilities of neurodevelopmental impairment. The sum of predicted probabilities of outcome categories from the logistic models did not equal 100% for about half of the study infants, ranging from 87.7% to 124.0%, and the logistic model of neurodevelopmental impairment greatly overpredicted the risk among low-risk infants and underpredicted among high-risk infants.

Conclusions

Estimating multiple logistic regression models of dichotomized outcomes may result in poorly calibrated predictions. For an outcome with multiple ordinal categories, continuation-ratio logit regression is a useful alternative to standard multinomial logit regression. It produces better calibrated predictions and has the advantages of simplicity in model interpretation and flexibility to include outcome category-specific predictors and random-effect terms for patient heterogeneity by hospital.

Risk prediction

Multicategory outcome

Discrimination

Calibration

Competing risks

Preterm infant (limit: 3–10)

Multivariable risk prediction models are routinely used by healthcare providers in patient counseling and clinical decision-making. The outcomes of these models are often binary and the algorithm is typically based on logistic regression. While outcomes of many medical conditions can have more than two categories, they may be dichotomized by combining multiple categories together and modeled using logistic regression. For an outcome of death, illness or illness-free survival, for example, a single category, illness-free survival, or a combined category, death or illness, may be of interest and modeled. Although multinomial logit models can simultaneously predict probabilities of multiple outcome categories and thus have the advantage of avoiding loss of detailed information, they are not known to have superior predictive performance.

Few studies have compared predictive performance of logistic models and multinomial logit models. Biesheuvel et al. and Roukema et al. assessed model discrimination and did not find a meaningful difference [1, 2]. More recently, Van Calster and McLernon et. al. argued that model calibration performance should not be overlooked and poor calibration may make a prediction model clinically useless or even harmful [3]. A study by Edlinger et al. focused on calibration performance of alternative multinomial models for ordinal outcomes but did not compare with that of logistic models of dichotomized outcomes [4].

In this paper, we describe alternative methods for modeling multicategory outcomes. Using data on mortality and neurological development among extremely preterm infants, we develop logistic and multinomial logit risk prediction models and assess both model discrimination and calibration. We also compare their statistical advantages and flaws, and differences in model interpretation.

We consider risk prediction for a multicategory outcome among patients admitted into a variety of hospitals. Let Y_si indicate an outcome with J categories of the ith patient in the sth hospital, and X_si1 – X_si5, for instance, be five predictor variables selected for inclusion in the model. With data collected on patients in multiple hospitals, patient heterogeneity by hospital can cause poor predictive performance [4, 5]. We try to add hospital random-effect terms in our models to account for hospital-level variation in outcomes.

Dichotomized logistic regression

Let the probability of outcome category j be π_si(j) = Pr{Y_si = j}. A logistic regression model can be estimated for each outcome category,

$$\text{logit}\left({\pi }_{si}\left(j\right)\right)={{{\beta }_{0j}+\beta }_{1j}X}_{si1}+{{\beta }_{2j}X}_{si2}+{{\beta }_{3j}X}_{si3}+{{\beta }_{4j}X}_{si4}+{{\beta }_{5j}X}_{si5}+{a}_{s}, j=1,\dots , J$$

where the intercept b_0j and coefficients b_1j – b_5j are parameters to be estimated. We further assume that the hospital random-effect term a_s follows a Normal distribution with zero mean and a constant variance. A drawback of this method is that sum of predicted probabilities over all outcome categories for a patient is not constrained to 100%.

Multinomial continuation-ratio logit regression

As an extension of logistic regression, a standard multinomial logit model simultaneously fits J-1 logit models of each outcome category relative to a fixed reference category and constrains the sum of all predicted probabilities to 100%, that is, $\sum _{j=1}^{J}{\pi }_{si}\left(j\right)=1$. A limitation of this method is that the use of a same reference category and the inclusion of random-effect terms can make model estimation and interpretation difficult [6].

For an outcome with ordered categories, it is preferable to use alternative forms of multinomial logit models that exploit ordinal nature of the outcome categories [7]. We model the sequentially defined conditional probability in the jth category or higher, π_si(j|Y_si ≥ j) = Pr{Y_si = j | Y_si ≥ j}. The continuation-ratio logit models are of the following form,

$$\text{Logit}\left({\pi }_{si}\left(j|{Y}_{si}\ge j\right)\right)={{{\beta }_{0j}+\beta }_{1j}X}_{si1}+{{\beta }_{2j}X}_{si2}+{{\beta }_{3j}X}_{si3}+{{\beta }_{4j}X}_{si4}+{{\beta }_{5j}X}_{si5}+{a}_{si},$$

$$j=1,\dots , J-1$$

We also assume that the random-effect terms (a_s1,…,a_s(J−1)) jointly follow a multivariate Normal distribution with zero means [8]. Various forms of the variance-covariance matrix may be specified to represent the correlation structure among the continuation-ratio logits. A simple diagonal form, for example, indicates independent random effects.

Logistic competing risks regression

Competing-risk bias is often a concern in dichotomized logistic regression estimation and to overcome this bias composite outcomes combining competing-risk categories such as illness or death are commonly used as study endpoints [9, 10]. A statistical method developed to model time-to-event data adjusting for competing risks, logistic competing risks regression, can be potentially useful [11]. Let T_sij be time to the occurrence of event j of patient i in hospital s, the cumulative incidence function by a preset time t for an event of interest, say j = 1, is then defined as F_si1(t) = Pr{T_si1 < = t}. The logistic competing risks regression fits a model of binary outcome of the occurrence of the event by time t,

$$\text{logit}\left\{{F}_{sij}\left(t\right)\right\}={{{\beta }_{0j}\left(t\right)+\beta }_{1j}X}_{si1}+{{\beta }_{2j}X}_{si2}+{{\beta }_{3j}X}_{si3}+{{\beta }_{4j}X}_{si4}+{{\beta }_{5j}X}_{si5}$$

A nice feature of this model is that the coefficients can be interpreted in terms of odd ratios.

Patient outcomes and predictor variables

We obtained data on 3927 infants who were born extremely preterm in 19 hospitals in the U.S. and enrolled at birth into an observational study [12]. These infants did not have major congenital anomalies and received postnatal intensive care. All the surviving infants completed assessments of neurodevelopmental impairment (NDI) at a single timepoint of 22–26 months’ age corrected for prematurity [13]. NDI is a comprehensive measure of child development based on structured physical examinations and functional assessments. Informed consents were obtained for all infants at hospitals that required parental consent.

For simplicity, we created an outcome with three ordered categories, death, survival with NDI, or survival without NDI (NDI-free survival), and selected five predictor variables, birth weight and gestational age, sex, singleton birth, and exposure to antenatal corticosteroids. These variables have been previously included in the widely used NICHD Extremely Preterm Birth Outcome Tool model [14, 15]. The birth weights and gestational ages of the infants ranged from 401 to 1000 grams (mean: 675 grams) and 22 to 25 weeks (22–23 weeks: 21%), 47% were female, 74% were singleton births, and 85% received antenatal corticosteroids.

Estimated models

We used SAS PROC GLIMMIX to fit random-effect logistic models and continuation-ratio logit model and the R package riskRegression to fit logistic competing risks model [11, 16]. We should note that the original patient-level data file should be re-structured such that a patient could have as many as J – 1 records stacked together for the estimation of continuation-ratio logit model, and age in days at death or date of NDI examination was used as time to event of interest for the estimation of logistic competing risks model.

The estimated odds ratios of the predictor variables and the variances of the random hospital effects from three separate logistic models of dichotomized outcomes, death (vs survival), NDI (vs death or NDI-free survival) and NDI-free survival (vs death or NDI) and a multinomial continuation-ratio logit model that jointly predicts the probabilities of death (vs survival) and NDI (vs NDI-free survival) are presented in Table 1. The predictor variables showed similar effects on death but very different effects on NDI. Notably, antenatal corticosteroid exposure had a significant and positive effect on NDI in the logistic model and a significant but negative effect on NDI in the continuation-ratio logit model. Also, the large variance estimates of the random hospital effects relative to their standard errors in these models suggested significant differences in outcomes among hospitals.

Table 1

Estimated odds ratio (95% CI) and variance (SE) of random hospital effect from logistic models of dichotomized outcomes and continuation-ratio logit model
	Logistic models of dichotomized outcomes			Continuation-ratio logit model
Predictor variable	Death	NDI	NDI-free Survival	Death	NDI among surviving infants
Birth weight (100 grams)	0.66 (0.62–0.71)	1.05 (0.98–1.12)	1.45 (1.36–1.56)	0.66 (0.62–0.71)	0.80 (0.74–0.87)
Gestational age (weeks)
22–23	2.76 (2.22–3.43)	0.61 (0.48–0.78)	0.43 (0.33–0.55)	2.59 (2.09–3.22)	1.23 (0.91–1.68)
24	1.46 (1.23–1.72)	0.98 (0.83–1.16)	0.72 (0.61–0.85)	1.46 (1.24–1.72)	1.20 (0.99–1.46)
25	1.00	1.00	1.00	1.00	1.00
Female	0.57 (0.50–0.66)	0.97 (0.84–1.12)	1.85 (1.60–2.15)	0.58 (0.50–0.67)	0.66 (0.55–0.78)
Singleton birth	0.86 (0.74–1.01)	1.05 (0.89–1.24)	1.13 (0.95–1.34)	0.85 (0.73–1.00)	0.97 (0.79–1.18)
Antenatal corticosteroids	0.53 (0.43–0.65)	1.28 (1.03–1.60)	1.73 (1.35–2.21)	0.52 (0.42–0.63)	0.65 (0.49–0.87)
Hospital variance	0.165 (0.065)	0.126 (0.051)	0.294 (0.110)	0.147 (0.055)	0.213 (0.064)

Considering NDI an outcome category of interest and death a competing risk with NDI, we estimated a logistic competing risks regression model of NDI and a logistic model of composite outcome of NDI or death. The estimation results are presented in Table 2. We can see that the odds ratios of the predictor variables from the logistic competing risk model were quite close to those from the logistic model of NDI and the odds ratios from the logistic model of NDI or death were quite close to those from the logistic model of death.

Table 2

Odds ratio (95% CI) from models of competing-risk outcome categories
	Logistic competing risk model	Logistic model of composite outcome
Predictor variable	NDI(ref: Death or NDI-free)	NDI or Death (ref: NDI-free)
Birth weight (100 grams)	1.06 (0.98–1.15)	0.69 (0.64–0.74)
Gestational age (weeks)
22–23	0.58 (0.43–0.78)	2.34 (1.81–3.02)
24	0.98 (0.80–1.20)	1.38 (1.17–1.64)
25	1.00	1.00
Female	0.96 (0.80–1.16)	0.54 (0.47–0.63)
Singleton birth	1.03 (0.83–1.29)	0.89 (0.75–1.05)
Antenatal corticosteroids	1.21 (0.93–1.59)	0.58 (0.45–0.74)

Model predictive performance

We computed the Brier scores and C-statistics to assess discrimination and general validity. To correct for statistical optimum we generated 200 bootstrap samples drawn with replacement from the model predicted probabilities [17, 18]. Four increasingly stringent levels of calibration have been suggested for measuring model calibration, mean, weak, moderate, or strong calibration [3]. We assessed model calibration at the first three levels using means and ranges of predicted probabilities, calibration intercepts and slopes and calibration plots.

Measures of predictive performance of the logistic models and the continuation-ratio logit model are summarized in Table 3. The similar Brier scores and C-statistics indicate similar overall model validity and discrimination. The large C-statistics (> 0.7) for death and NDI-free survival suggest equally satisfactory discrimination, but the lower and slightly different C-statistics for NDI, 0.623 for the logistic model and 0.637 for the continuation-ratio logit model, suggest less satisfactory discrimination. The means of the predicted probabilities of death, NDI and NDI-free survival also are nearly same, indicating similar calibration. But the predicted probabilities of NDI from the logistic model had a slightly narrower range (8.5% − 48.8% vs. 6.6% − 52.1%). A more notable difference, however, is that the sum of all predicted probabilities from the logistic models did not equal 100% for about half of all study infants, ranging from 87.7–124.0%, but the sum from the continuation-ratio logit model equaled 100% for all study infants. The calibration intercepts and slopes were similarly close to zero and one for death and NDI-free survival, but slightly greater than zero and one for NDI.

Table 3

Measures of model predictive performance
Outcome category	Dichotomized logistic models	Continuation-ratio logit model
	Overall validity: Brier score/corrected on 200 bootstrap samples
Death	0.199/0.203	0.199/0.202
NDI	0.194/0.196	0.192/0.194
NDI-free survival	0.186/0.189	0.186/0.188
	Discrimination: C-statistics (95% CI)/corrected on 200 bootstrap samples
Death	0.738 (0.722–0.754)/0.729	0.738 (0.722–0.753)/0.729
NDI	0.623 (0.604–0.643)/0.606	0.637 (0.618–0.656)/0.619
NDI-free survival	0.730 (0.714–0.746)/0.720	0.730 (0.713–0.746)/0.721
	Calibration: mean of predicted probability (range)
Death	40.3 (3.7–91.6)	40.3 (4.0–91.2)
NDI	28.1 (8.5–48.8)	28.1 (6.6–52.1)
NDI-free survival	31.5 (1.5–81.0)	31.6 (0.8–78.9)
Sum over all categories	100.0 (87.7–124.0)	100.0 (100.0–100.0)
	Calibration intercept/slope
Death	0.009/1.026	0.020/1.051
NDI	0.121/1.133	0.164/1.184
NDI-free survival	0.023/1.035	0.045/1.076

We further assessed model calibration by exploiting the fact that the predicted probabilities for each patient from the logistic regression models did not add up to 100%. We divided all the infants into decile groups by the sums of their predicted probabilities and calculated the means of the model predicted probabilities. In Fig. 1, we can see that the means of the predicted probabilities of NDI from the continuation-ratio logit model tended to track the observed rates more closely. But those from the logistic model were much higher than the observed rates at the lower end of the observed rates and much lower than the observed rates at the higher end of the observed rates. We noted that infants in the three groups with the lowest observed rates had sums of the predicted probabilities greater than 100% and infants in the three groups with the highest observed rates had sums of the predicted probabilities less than 100%. The mean predicted probabilities of death from the continuation-ratio logit model and those from the logistic model nearly overlapped. They agreed well with the observed rates. We also compared calibration plots of the predicted probabilities among infants whose sums of the predicted probabilities did not equal 100% in Fig. 2. The predicted probabilities of NDI from the logistic model had a smaller ratio of the predicted to the observed (0.872 vs 0.940) and a larger calibration intercept (0.206 vs 0.094).

Because the estimated competing risks model had odds ratios close to those from the logistic model of NDI and the estimated logistic model of the composite outcome of NDI or death had odds ratios that were the inverse of the logistic model of NDI-free survival, they should also have similar discrimination and calibration. Additionally, we computed C-statistics of the logistic model of the composite outcome for predicting death or NDI alone. Prediction of death (AUC = .719) was moderate, but prediction of NDI was poor (AUC = .485).

Risk prediction models are important tools in clinical decision-making and prognosis often takes the form of multiple categories. We have compared two commonly used methods for modeling multicategory outcomes, dichotomized logistic regression and multinomial logit regression, in an application of predicting mortality and neurodevelopmental impairment among extremely preterm infants. Because the outcome has three ordinal categories, we also used an alternative multinomial logit model, continuation-ratio logit model.

We assessed both discrimination and calibration of the estimated models. Consistent with the findings by Biesheuvel et al. and Roukema et al. [1, 2], our results showed that the logistic models and continuation-ratio logit models had similarly satisfactory discrimination in predicting death and survival without neurodevelopmental impairment. These models also had similar calibration as measured by the average predicted probabilities and by calibration intercepts and slopes. However, the sum of all predicted probabilities from the logistic models for each infant ranged from 87.7–124.0%. We found that the logistic model of neurodevelopmental impairment had slightly smaller C-statistics and among infants whose sum of all predicted probabilities did not equal 100% it had worse calibration.

To overcome potential bias due to death as a competing risk, we applied an extension of logistic regression method, logistic competing risks regression, to develop a prediction model of neurodevelopmental impairment. Because time to diagnosis of NDI was determined only at one fixed time, 22–26 months’ age corrected for prematurity, the estimated odds ratios for predictor variables were close to those in the logistic model of neurodevelopmental impairment. We also estimated a logistic model of composite of neurodevelopmental impairment or death and showed that it could not be used for predicting neurodevelopmental impairment. Competing risks are not only of statistical interest, but also can be of substantive interest. In pediatric research, for example, it is increasingly concerned how the risk and burden of illness among extremely preterm infants are changing with improved survival [19]. Further investigation into statistical methods for modeling competing risks and collection of more detailed data on event time will be needed.

Constraining sum of all predicted probabilities of outcome categories for each patient to 100% and accommodating competing risks are important considerations in the validation of prediction models for multicategory outcomes. Additionally, there are other statistical and practical issues that should be considered. We prepared a list of these issues for comparison between dichotomized logistic and multinomial logit regression in Table 4. In general, simplicity in model interpretation facilitates acceptance and usage of a model by clinicians. Flexibility in model fitting to allow outcome category-specific predictor variables helps avoid statistical overfitting and including random-effect terms to accommodate patient heterogeneity by hospital improves model calibration [3].

Table 4

Comparison of predictive modeling methods on other statistical and practical issues
	Methods for risk prediction of multicategory outcomes
Issues to consider	Dichotomized logistic regression	Continuation-ratio logit regression	Logistic competing risks regression
Interpretation of predictor effects	Odds ratio	Conditional odds ratio dependent on ordered outcome category	Odds ratio
Constrains sum of all predicted probabilities to 100%	No	Yes	No
Allows inclusion of random-effect terms	Yes	Yes	No/Robust variance estimates
Allows inclusion of outcome category-specific predictor variables	Yes	Yes	Yes
Accommodates competing risks	No	Yes	Yes
Availability in statistical software	SAS, Stata, R	SAS, Stata, R	R

Both logistic regression and logistic competing risks regression produce odds ratio estimates for predictor variables but have the flaw that sum of all predicted probabilities of outcome categories for each patient is not constrained to 100%. Logistic regression also has the advantages of allowing for outcome-category specific predictor variables and random-effect terms, and wide availability of statistical programs for model estimation. Logistic competing risks regression accounts for competing risks but does not allow the inclusion of random-effect terms for patient heterogeneity and requires time-to-event data.

Multinomial logit regression constrains sum of all predicted probabilities of outcome categories for each patient to 100%. But a standard multinomial logit model has some known limitations, including difficulty to explain the prediction results to clinicians or patients due to the use of a fixed reference category, lack of flexibility to allow for outcome category-specific predictors and complications caused by the inclusion of random-effect terms. As an alternative for ordinal outcome, we estimated a continuous-ratio logit model to predict the probability of death and the probability of neurodevelopmental impairment conditional on surviving. This addressed the need of clinicians and patients for separate information on death and impairment, which could be valued differently in their decision about treatment options. It also afforded us the statistical benefits of including random-effect terms and outcome category-specific predictor variables of neurodevelopmental impairment in the model. The infant outcomes in our data have been found to vary significantly across hospitals after controlling for infant characteristics [15]. To improve the modest model performance in predicting neurodevelopmental impairment, we hope to be able to add more variables predictive of this outcome in the future [20, 21].

A multicategory outcome is often dichotomized and modeled using logistic regression in studies developing prediction models. Because a single outcome category is often of interest, the shortcomings of this method have not received much attention. Although logistic models and multinomial logit models may have similar predictive performance, logistic models do not constrain predicted probabilities of all outcome categories to 100% for a patient and can produce poorly calibrated predictions. We recommend the use of various alternative forms of multinomial logit models such as continuation-ratio logit models for ordinal outcomes, which allow for the accommodation of patient heterogeneity by hospital and the inclusion of outcome category-specific predictors. To overcome competing-risk bias among outcome categories, modeling composite of outcome categories can lead to misleading predictions. Application of logistic competing risks regression and collection of time-to-event data needed should be explored in future studies.

Ethics approval and consent to participate

The institutional review board at each hospital approved participation as clinical center. Waiver of consent for enrollment at birth into the observational study was granted at most participating hospitals, but parental consent was required at five hospitals (4 written, 1 oral). Most hospitals required written parental consent for participation in the follow-up study, but five hospitals allowed participation under waiver of consent. Informed consents were obtained for all infants at hospitals that required parental consent.

The institutional review board at RTI International approved participation as data coordinating center and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) Neonatal Research Network publication committee approved the submission of this study for publication.

Consent for publication

Not applicable

Availability of data and materials

The datasets used are available from Dr. Lei Li on reasonable request.

Competing interests

The authors declare that they have no competing interests.

Funding

The National Institutes of Health and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) provided grant support for the Neonatal Research Network’s Generic Database and Follow-up Study through cooperative agreements with participating sites and with RTI International (U10 HD36790), which serves as the data coordinating center. We are indebted to our medical and nursing colleagues and the infants and their parents who agreed to take part in these studies.

Additional support was provided by a RTI Fellows publication award.

Authors' contributions

LL and MAR contributed to the conception of the work and drafted the paper. LL conducted all the data analyses. AD contributed to the acquisition and interpretation of data. GB and AD contributed to the revision of the paper. All the four authors approved the submitted version and agreed to be personally accountable for the author's own contributions and to ensure that questions related to the accuracy or integrity of any part of the work, even ones in which the author was not personally involved, are appropriately investigated, resolved, and the resolution documented in the literature.

Acknowledgements

We thank Grier Page, Senior Fellow at RTI, for insightful comments on the manuscript.

Biesheuvel CJ, Vergouwe Y, Steyerberg EW, Grobbee DE, Moons KG. Polytomous logistic regression analysis could be applied more often in diagnostic research. J Clin Epidemiol. 2008;61:125–34.
Roukema J, van Loenhout RB, Steyerberg EW, Moons KG, Bleeker SE, Moll HA. Polytomous regression did not outperform dichotomous logistic regression in diagnosing serious bacterial infections in febrile children. J Clin Epidemiol. 2008;61:135–41.
Van Calster B, McLernon DJ, van Smeden M, et al. Calibration: the Achilles heel of predictive analytics. BMC Med. 2019;17(1):230. https://doi.org/10.1186/s12916-019-1466-7.
Edlinger M, van Smeden M, Alber HF, Wanitschek M, Van Calster B. Risk prediction models for discrete ordinal outcomes: Calibration and the impact of the proportional odds assumption. Stat Med. 2022;41:1334–60.
Falconieri N, Van Calster B, Timmerman D, Wynants L. Developing risk models for multicenter data using standard logistic regression produced suboptimal predictions: A simulation study. Biom J. 2020;62:932–44.
Hartzel J, Agresti A, Caffo B. Multinomial logit random effects models. Stat Modelling. 2001;1:81–102.
Agresti A. Categorical Data Analysis. 2nd Edition, John Wiley and Sons Inc., Hoboken; 2002.
Coull BA, Agresti A. Random effects modeling of multiple binomial responses using the multivariate binomial logit-normal distribution. Biometrics. 2000;56:73–80.
Paneth N, Gryzbowski M, LaGamma E. Combined Outcomes in Prevention Trials: Rarely A Good Idea. American Epidemiological Society 2014. (http://www.epi.msu.edu/video/paneth/copt/default).
Manja V, AlBashir S, Guyatt G. Criteria for use of composite endpoints for competing risks – A systematic survey of the literature with recommendations. J Clin Epidemiol. 2017;82:4–11.
Gerds TA, Scheike TH, Andersen PK. Absolute risk regression for competing risks: interpretation, link functions, and prediction. Stat Med. 2012;31:3921–30.
Rysavy MA, Li L, Bell EF, Das A, Hintz SR, Stoll BJ, Vohr BR, Carlo WA, Shankaran S, Walsh MC, Tyson JE, Cotten CM, Smith PB, Murray JC, Colaizy TT, Brumbaugh JE, Higgins RD. Between-hospital variation in treatment and outcomes in extremely preterm infants. New Engl J Med. 2015;372:1801–11.
Vohr BR, Wright LL, Poole WK, McDonald SA. Neurodevelopmental outcomes of extremely low birth weight infants < 32 weeks’ gestation between 1993 and 1998. Pediatrics. 2005;116:635–43.
Tyson JE, Parikh NA, Langer J, Green C, Higgins RD. Intensive care for extreme prematurity–moving beyond gestational age. New Engl J Med. 2008;358:1672–81.
Rysavy MA, Horbar JD, Bell EF, Li L, Greenberg LT, Tyson JE, Patel RM, Carlo WA, Younge NE, Green CE, Edwards EM, Hintz SR, Walsh MC, Buzas JS, Das A, Higgins RD. Assessment of an Updated Neonatal Research Network Extremely Preterm Birth Outcome Model in the Vermont Oxford Network. JAMA Pediatr. 2020;174:e196294.
SAS Institute Inc. SAS/STAT® User’s Guide. Cary, NC: SAS Institute Inc; 2021.
Harrell FE, Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy and measuring and reducing errors. Stat Med. 1996;15:361–87.
Steyerberg EW, Harrell FE, Borsboom GJ, Eijkemans MJ, Vergouwe Y, Habbema JD. Internal validation of predictive models: efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001;54:774–81.
Younge N, Goldstein R, Bann CM, et al. Survival and neurodevelopmental outcomes among periviable infants. N Engl J Med. 2017;376:617–28.
Marlow N. Keeping up with outcomes for infants born at extremely low gestational ages. JAMA Pediatr. 2015;169:207–8.
Linsell L, Malouf R, Morris J, Kurinczuk JJ, Marlow N. Prognostic factors for poor cognitive development in children born very preterm or with very low birth weight: A systematic review. JAMA Pediatr. 2015;169:1162–72.

No competing interests reported.

Download PDF

Editorial decision: Revision requested
07 Mar, 2024
Reviews received at journal
15 Feb, 2024
Reviews received at journal
13 Feb, 2024
Reviewers agreed at journal
08 Feb, 2024
Reviewers agreed at journal
02 Feb, 2024
Reviewers invited by journal
02 Feb, 2024
Editor invited by journal
02 Feb, 2024
Editor assigned by journal
02 Feb, 2024
Submission checks completed at journal
02 Feb, 2024
First submitted to journal
30 Jan, 2024

You are reading this latest preprint version

Comparing methods for risk prediction of multicategory outcomes: dichotomized logistic regression vs. multinomial logit regression

Status:

Version 1

Abstract

Figures

Background

Methods

Dichotomized logistic regression

Multinomial continuation-ratio logit regression

Logistic competing risks regression

Results

Patient outcomes and predictor variables

Estimated models

Model predictive performance

Discussion

Conclusion

Declarations

Ethics approval and consent to participate

Consent for publication

References

Additional Declarations

Status:

Version 1