Skip to main content
Log in

Analyzing the capabilities of the HB logit model for choice-based conjoint analysis: a simulation study

  • Original paper
  • Published:
Journal of Business Economics Aims and scope Submit manuscript

Abstract

The authors conduct an extensive simulation study to examine the capabilities of the Hierarchical Bayes (HB) logit model for choice-based conjoint (CBC) studies. The statistical performance of HB is evaluated under experimentally varying factor level settings using criteria for goodness-of-fit, parameter recovery and predictive accuracy. The results provide guidance to market researchers who are confronted with the problem that clients desire to include more and more attributes while keeping the choice task manageable. The results show that for simple CBC settings HB estimation proves to be quite robust. One of the main findings for simple CBC settings is that holding other factors at convenient levels far more attributes than previously suggested can be used in CBC studies. Further, sample size and/or the number of choice tasks per respondent can be noticeably reduced. However, for more complex CBC settings with an already high number of parameters (part-worths) but rather little information available from respondents, the HB model is starting to collapse if more than one of those factors (attributes, sample size, choice tasks) is set to an extreme level.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. See for example Rao (2014), p. 171.

  2. To measure the predictive performance of HB-CBC in our simulation study, we include two fixed holdout tasks in addition to the maximum of 15 random tasks for estimation.

  3. E.g., compare Haaijer and Wedel (2010).

  4. The Gumbel distribution is also known as the extreme value distribution of type I: \( {\text{G}}\left( {\text{x}} \right)\, = \,{\text{exp}}( - \,{\text{exp}}( - \frac{1}{\mu }(x - \eta ))),\;\mu > 0. \)

  5. The variance of the Gumbel distribution is \( \frac{{\pi^{2} \mu^{2} }}{6} \), which amounts to 1.645 if \( \mu \) = 1, and to 3.290 if \( \mu \) = \( \sqrt 2 \).

  6. The Hierarchical Bayes model for choice-based conjoint analysis is described in more detail in “Appendix 1”.

  7. Following previous conjoint simulation studies, we focus on main-effects models und do not include interactions between attributes. Consequently, the off-diagonal elements of the covariance matrix \( V_{\beta } \) are set to zero.

  8. The Brier score is commonly used as scoring rule to evaluate probability forecasts in meteorology. Basically, the Brier score can be applied to any case where the estimated probability that an event will occur is compared to whether the event actually occurred or not.

  9. Note that less choice tasks would guard against disengaged respondents or the application of simplification strategies in later choice tasks, as was raised by Kurz and Binner (2012).

  10. Note that all group means (3 vs. 4 levels, 4 vs. 5 levels) differ significantly with respect to parameter recovery and with respect to all four criteria measuring predictive accuracy, so that a stepwise increase in the number of attribute levels from 3 to 5 consistently leads to a worse model performance.

  11. The hit rate of a null model with 5 real alternatives plus none option is 16.67%. The observed hit rate of 65.31% is therefore 3.92 (65.31 divided by 16.67) times better than chance. Interestingly, the results for the Brier and spherical score do not coincide here. While the spherical score significantly decreases for an increasing number of alternatives, the Brier score significantly increases. For all other factors, Brier and spherical score always provide consistent results.

  12. Across our measures for parameter recovery and predictive validity, mean correlations and hit rates are particularly well interpretable from a managerial point of view, as both measures range between 0 and 1. That’s why we focus on those two measures in the following.

  13. For implementation, we chose one of the attributes to represent the brand or price attribute, respectively. In case the simplification strategy is based on choosing the alternative with the lowest offered price, we ordered the randomly generated price part-worths according to their magnitude so that lower prices are associated with higher utilities for those respondents. The respondents who are assumed to use a simplification strategy are determined by random, respectively.

  14. See in particular Rao (2014), p. 171, and also Green and Srinivasan (1990) and Hair et al. (2009).

  15. Note that the interaction effect between the number of respondents and the number of choice tasks per respondent turned out not significant in the ANOVAs for all performance measures (compare Table 6 in “Appendix 2”).

  16. Of course, Pearson correlations (as well as hit rates) are generally higher when CBC designs are simpler (compare Table 4).

References

  • Allenby GM, Ginter JL (1995) Using extremes to design products and segment markets. J Mark Res 32(4):392–403

    Article  Google Scholar 

  • Allenby GM, Arora N, Ginter JL (1995) Incorporating prior knowledge into the analysis of conjoint studies. J Mark Res 32(2):152–162

    Article  Google Scholar 

  • Andrews RL, Ainslie A, Currim IS (2002a) An empirical comparison of logit choice models with discrete versus continuous representations of heterogeneity. J Mark Res 39(4):479–487

    Article  Google Scholar 

  • Andrews RL, Ansari A, Currim IS (2002b) Hierarchical Bayes versus finite mixture conjoint analysis models: a comparison of fit, prediction, and partworth recovery. J Mark Res 39(1):87–98

    Article  Google Scholar 

  • Backhaus K, Hillig T, Wilken R (2007) Predicting purchase decisions with different conjoint analysis methods. A Monte Carlo simulation. Int J Market Res 49(3):341–364

    Article  Google Scholar 

  • Brier GW (1950) Verification of forecasts expressed in terms of probability. Mon Weather Rev 78(1):1–3

    Article  Google Scholar 

  • Chakraborty G, Ball D, Gaeth GJ, Jun S (2002) The ability of ratings and choice conjoint to predict market shares. A Monte Carlo simulation. J Bus Res 55(3):237–249

    Article  Google Scholar 

  • Cohen J (1988) Statistical power analysis for the behavioral sciences. Lawrence Erlbaum Associates, Hillsdale

    Google Scholar 

  • Currim IS, Weinberg CB, Wittink DR (1981) Design of subscription programs for a performing arts series. J Consum Res 8(1):67–75

    Article  Google Scholar 

  • Dhar R (1997) Consumer preference for a no-choice option. J Consum Res 24(2):215–231

    Article  Google Scholar 

  • Gelman A, Carlin JB, Stern HS, Rubin DB (2008) Bayesian data analysis. Chapman and Hall/CRC, Boca Raton

    Google Scholar 

  • Gneiting T, Raftery AE (2007) Strictly proper scoring rules, prediction, and estimation. J Am Stat Assoc 102(477):359–378

    Article  Google Scholar 

  • Green PE, Srinivasan V (1990) Conjoint analysis in marketing: new developments with implications for research and practice. J Mark 54(4):3–19

    Article  Google Scholar 

  • Greenberg E (2008) Introduction to Bayesian econometrics. Cambridge University Press, New York

    Google Scholar 

  • Haaijer R, Wedel M (2010) Conjoint choice experiments: General characteristics and alternative model specifications. In: Gustafsson A, Herrmann A, Huber F (eds) Conjoint measurement: methods and applications, 4th edn. Springer, Berlin, pp 199–230

    Google Scholar 

  • Hair JF, Black WC, Babin BJ, Anderson RE, Tatham RL (2009) Multivariate data analysis. Prentice Hall, Upper Saddle River

    Google Scholar 

  • Hauser JR, Rao VR (2004) Conjoint analysis, related modeling, and applications. In: Wind Y, Green PE (eds) Marketing research and modeling: progress and prospects, international series in quantitative marketing, vol 14. Springer, New York, pp 141–168

    Chapter  Google Scholar 

  • Hensher DA, Stopher PR, Louviere JJ (2001) An exploratory analysis of the effect of numbers of choice sets in designed choice experiments: an airline choice application. J Air Transp Manag 7(6):373–379

    Article  Google Scholar 

  • Hoogerbrugge M, van der Wagt K (2006) How many choice tasks should we ask? In: Proceedings of the 12th Sawtooth Software Conference. Orlando, pp 97–110

  • Huber J, Meissner M, Musalem A (2018) How does choice-based conjoint work? Talk at the Turbo CBC meeting at the 20th Sawtooth Software Conference (unpublished work)

  • Johnson RM, Orme BK (1996) Getting the most from CBC. Research paper series, Sawtooth Software Inc

  • Kneib T, Baumgartner B, Steiner WJ (2007) Semiparametric multinomial logit models for analyzing consumer choice behavior. Adv Stat Anal 91(3):225–244

    Article  Google Scholar 

  • Kurz P, Binner S (2012) The individual choice task threshold. Need for variable number of choice tasks. In: Proceedings of the 16th Sawtooth Software Conference. Orlando, pp 111–127

  • Lenk PJ, Orme B (2009) The value of informative priors in bayesian inference with sparse data. J Mark Res 46(6):832–845

    Article  Google Scholar 

  • Lenk PJ, Desarbo WS, Green PE, Young MR (1996) Hierarchical Bayes conjoint analysis: recovery of partworth heterogeneity from reduced experimental designs. Mark Sci 15(2):173–191

    Article  Google Scholar 

  • Louviere JJ, Woodworth G (1983) Design and analysis of simulated consumer choice or allocation experiments: an approach based on aggregate data. J Mark Res 20(4):350–367

    Article  Google Scholar 

  • Louviere JJ, Street DJ, Burgess L, Wasi N, Islam T, Marley AJ (2008) Modeling the choices of individual decision-makers by combining efficient choice experiment designs with extra preference information. J Choice Model 1(1):128–163

    Article  Google Scholar 

  • Louviere JJ, Carson RT, Burgess L, Street DJ, Marley AAJ (2013) Sequential preference questions factors influencing completion rates and response times using an online panel. J Choice Model 8:19–31

    Article  Google Scholar 

  • Orme B (2003) New advances shed light on HB anomalies. Research paper series, Sawtooth Software Inc

  • Pinnell J, Englert S (1997) Number of choice alternatives in discrete choice modeling. In: Proceedings of the 6th Sawtooth Software Conference. Seattle, pp 121–153

  • Rao VR (2014) Applied conjoint analysis. Springer, New York NY

    Book  Google Scholar 

  • Rossi PE, Allenby GM (1993) A Bayesian approach to estimating household parameters. J Mark Res 30(2):171–182

    Article  Google Scholar 

  • Rossi PE, Allenby GM, McCulloch RE (2005) Bayesian statistics and marketing. Wiley, Hoboken

    Book  Google Scholar 

  • Roulston MS (2007) Performance targets and the Brier score. Meteorol Appl 14(2):185–194

    Article  Google Scholar 

  • Rungie CM, Coote LV, Louviere JJ (2012) Latent variables in discrete choice experiments. J Choice Model 5(3):145–156

    Article  Google Scholar 

  • Sawtooth Software (2013) The CBC system for choice-based conjoint analysis version 8. Technical paper series, Sawtooth Software Inc

  • Selka S, Baier D (2014) Kommerzielle Anwendung auswahlbasierter Verfahren der Conjointanalyse: Eine empirische Untersuchung zur Validitätsentwicklung. Marketing ZFP. J Res Manag 36(1):54–64

    Google Scholar 

  • Selka S, Baier D, Kurz P (2014) The validity of conjoint analysis: an investigation of commercial studies over time. In: Spiliopoulou M, Schmidt-Thieme L, Janning R (eds) Data analysis, machine learning and knowledge discovery. Springer, Cham, pp 227–234

    Chapter  Google Scholar 

  • Srinivasan V, Jain AK, Malhotra NK (1983) Improving predictive power of conjoint analysis by constrained parameter estimation. J Mark Res 20(4):433–438

    Article  Google Scholar 

  • Thrasher JF, Anshari D, Lambert-Jessup V, Islam F, Mead E, Popova L, Salloum R, Moodie C, Louvier J, Lindblom EN (2018) Assessing smoking cessation messages with a discrete choice experiment. Tob Regul Sci 4(2):73–87

    Article  Google Scholar 

  • Train KE (2003) Discrete choice methods with simulation. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Vriens M, Wedel M, Wilms T (1996) Metric conjoint segmentation methods: a Monte Carlo comparison. J Mark Res 33(1):73–85

    Article  Google Scholar 

  • Wirth R (2010) HB–CBC, HB-best-worst-CBC or no HB at all? In: Proceedings of the 15th Sawtooth Software Conference. Newport Beach, pp 321–356

  • Wittink DR, Krishnamurthi L, Nutter JB (1982) Comparing derived importance weights across attributes. J Consum Res 8(4):471–474

    Article  Google Scholar 

  • Wittink DR, Krishnamurthi L, Reibstein DJ (1990) The effect of differences in the number of attribute levels on conjoint results. Mark Lett 1(2):113–123

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maren Hein.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1: The HB model for choice-based conjoint analysis

The HB method can be classified as a random-effects model in which the parameters are assumed to vary across respondents according to a probability distribution. In classical random-effects models the parameters of the probability distribution can be estimated. However, it is not possible to draw inferences about individual-level parameters (e.g., Rossi and Allenby 1993; Allenby and Ginter 1995; Rossi et al. 2005). In contrast, the HB random-effects model is characterized by a hierarchical structure that allows the estimation of individual-level parameters using both information from the probability distribution of all individuals and each individual’s choice data.

At the individual level (lower level) of the hierarchical structure the probability of each individual’s choice is modeled by a multinomial logit model. The MNL model is based on the assumption that error terms are independently and identically Gumbel-distributed and can be expressed as:

$$ p_{n} (j^{\prime } ) = \frac{{\exp (\beta_{n} x_{{j^{\prime } }} )}}{{\sum\nolimits_{j = 1}^{J} {\exp (\beta_{n} x_{j} )} }}, $$

where \( p_{n} (j^{\prime } ) \) is the probability that the nth respondent chooses the j′-th alternative in a particular choice task, \( \beta_{n} \) represents the vector of part-worths for the nth respondent, and \( x_{{j^{\prime}}} \) is a dummy vector for the attribute levels of alternative \( j^{\prime} \).

At the population level (upper level) the multivariate normal distribution is typically used as probability distribution (first-stage prior) to link a respondent’s individual-level preferences to the population level of the model:

$$ \beta_{n} \,\sim \,{\text{N}}\left( {\overline{\beta } ,V_{\beta } } \right), $$

where \( \overline{\beta } \) represents the vector of means of the distribution of individuals’ part-worths, and \( V_{\beta } \) is the covariance matrix that captures the extent of heterogeneity (as well as the correlation in the part-worths) across individuals.

To estimate the parameters \( \overline{\beta } \) and \( V_{\beta } \) of the prior distribution, hyperprior distributions (second-stage priors) have to be specified which are by default chosen to be very diffuse and nearly flat. It is common to assume that the vector of means follows a normal distribution and the covariance matrix is inverse Wishart distributed (cf. Rossi et al. 2005). \( \beta_{n} \), \( \overline{\beta } \) and \( V_{\beta } \) are unknown and have to be estimated using the information from the underlying choice data. Thus the prior information is combined with the observed choice data to estimate the posterior distributions. Markov chain Monte Carlo simulation methods are used for the estimation of the posterior distributions of the parameters, as the form of the posterior distribution is not analytically tractable (e.g. Allenby et al. 1995). Using MCMC techniques random draws from the posterior distributions are generated by an iterative process where one of the three parameters is estimated at a time conditional on the values for the other two parameters (e.g. Train 2003). After a burn-in phase, the draws are finally averaged to calculate point estimates of the parameters.

Appendix 2

See Tables 6 and 7.

Table 6 F-tests of main and interaction effects w.r.t. performance measures (N = 1296; p-values in parentheses)
Table 7 Effect sizes of main effects measured by eta squared (η2)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hein, M., Kurz, P. & Steiner, W.J. Analyzing the capabilities of the HB logit model for choice-based conjoint analysis: a simulation study. J Bus Econ 90, 1–36 (2020). https://doi.org/10.1007/s11573-019-00927-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11573-019-00927-4

Keywords

JEL Classification

Navigation