Abstract
In this paper, a method to study travel behaviour dynamics by constructing detailed synthetic pseudo panels from repeated cross-sectional data is presented. The method is based on the modelling of a high-dimensional joint distribution of travel preferences conditional on detailed socio-economic profiles by using a conditional variational autoencoder (CVAE). The CVAE is a neural-network-based generative model which allows the modelling of very detailed joint and conditional distributions, potentially defined by dozens or even hundreds of attributes in a flexible non-parametric form. The proposed method is used to rank detailed cohorts of individuals into slow and fast movers with respect to the speed at which their travel behaviour change over time. This gives an interesting insight into the types of individuals who are easily motivated to change their behaviour as opposed to those who are less flexible. Specifically, we investigate the dynamics of transport preferences for a fixed pseudo panel of individuals from a large Danish cross-sectional data set covering the period from 2006 to 2016. The comparison of the travel preference distributions from 2006 and 2016 shows that the prototypical fast mover is a single young woman who lives in a large city, whereas the typical slow mover is a middle-aged man with high income from a nuclear family who lives in a detached house outside a city. However, given that it is possible to rank individuals across very detailed socio-economic classifications, many other relationships can be explored. Finally, the CVAE can be directly applied to the population synthesis problem in microsimulation by modelling the distribution of socio-economic profiles conditional on other variables.
Similar content being viewed by others
References
Antman, F., McKenzie, D.J.: Earnings mobility and measurement error: a pseudo-panel approach. Econ. Dev. Cult. Change 56(1), 125–161 (2007)
Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, Berlin (2006)
Borysov, S.S., Rich, J., Pereira, F.C.: How to generate micro-agents? A deep generative modeling approach to population synthesis. Transp. Res. Part C: Emerg. Technol. 106, 73–97 (2019). https://doi.org/10.1016/j.trc.2019.07.006
Choi, E., Biswal, S., Malin, B.A., Duke, J., Stewart, W.F., Sun, J.: Generating multi-label discrete patient records using generative adversarial networks. In: MLHC (2017)
Cirillo, C., Xu, R., Bastin, F.: A dynamic formulation for car ownership modeling. Transp. Sci. 50(1), 322–335 (2016). https://doi.org/10.1287/trsc.2015.0597
Cramer, J.S.: Efficient grouping, regression and correlation in Engel curve analysis. J. Am. Stat. Assoc. 59(305), 233–250 (1964)
Dargay, J.M.: Determinants of car ownership in rural and urban areas: a pseudo-panel analysis. Transp. Res. Part E: Logist. Transp. Rev. 38(5), 351–366 (2002). https://doi.org/10.1016/S1366-5545(01)00019-9
Dargay, J.M., Vythoulkas, P.C.: Estimation of a dynamic car ownership model: a pseudo-panel approach. J. Transp. Econ. Policy 33(3), 287–301 (1999)
Deaton, A.: Panel data from time series of cross-sections. J. Econom. 30(1), 109–126 (1985). https://doi.org/10.1016/0304-4076(85)90134-4
Dellas, H., Koubi, V.: Business cycles and schooling. Eur. J. Polit. Econ. 19(4), 843–859 (2003). https://doi.org/10.1016/S0176-2680(03)00039-9
de Haas, M., Scheepers, C., Harms, L., Kroesen, M.: Travel pattern transitions: applying latent transition analysis within the mobility biographies framework. Transp. Res. Part A: Policy Pract. 107, 140–151 (2018). https://doi.org/10.1016/j.tra.2017.11.007
Farooq, B., Bierlaire, M., Hurtubia, R., Flötteröd, G.: Simulation based population synthesis. Transp. Res. Part B: Methodol. 58, 243–263 (2013). https://doi.org/10.1016/j.trb.2013.09.012
Gardes, F., Duncan, G.J., Gaubert, P., Gurgand, M., Starzec, C.: Panel and pseudo-panel estimation of cross-sectional and time series elasticities of food consumption: The case of U.S. and polish data. J. Bus. Econ. Stat. 23(2), 242–253 (2005). http://www.jstor.org/stable/27638815
Gärling, T., Axhausen, K.W.: Introduction: habitual travel choice. Transportation 30(1), 1–11 (2003). https://doi.org/10.1023/A:1021230223001
Garrido, S., Borysov, S.S., Pereira, F.C., Rich, J.: Prediction of rare feature combinations in population synthesis: application of deep generative modelling. arXiv:1909.07689v1 (2019)
Golob, T.F., Kitamura, R., Long, L.: Panels for Transportation Planning, 1st edn. Transportation Research, Economics and Policy, Springer, New York (1997). https://doi.org/10.1007/978-1-4757-2642-8
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in neural information processing systems, vol. 27, pp. 2672–2680. Curran Associates Inc, Red Hook (2014)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)
Goulias, K.G.: Longitudinal analysis of activity and travel pattern dynamics using generalized mixed Markov latent class models. Transp. Res. Part B: Methodol. 33(8), 535–558 (1999). https://doi.org/10.1016/S0191-2615(99)00005-3
Haustein, S., Siren, A.: Older people’s mobility: segments, factors, trends. Transp. Rev. 35(4), 466–487 (2015). https://doi.org/10.1080/01441647.2015.1017867
Higgins, I., Matthey, L., Pal, A., Burgess, C., Glorot, X., Botvinick, M., Mohamed, S., Lerchner, A.: beta-vae: Learning basic visual concepts with a constrained variational framework. In: International Conference on Learning Representations https://openreview.net/forum?id=Sy2fzU9gl (2017)
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006). https://doi.org/10.1126/science.1127647
Hsiao, C.: Analysis of Panel Data. Econometric Society Monographs, 3rd edn. Cambridge University Press, Cambridge (2014). https://doi.org/10.1017/CBO9781139839327
Huang, B.: The Use of Pseudo Panel Data for Forecasting Car Ownership. MPRA Paper 7086, University Library of Munich, Germany. https://ideas.repec.org/p/pra/mprapa/7086.html (2007)
Jang, E., Gu, S., Poole, B.: Categorical reparameterization with Gumbel-Softmax (2016). arXiv preprint arXiv:161101144
Kaplan, R.M., Atkins, C.J.: Selective attrition causes overestimates of treatment effects in studies of weight loss. Addict. Behav. 12(3), 297–302 (1987). https://doi.org/10.1016/0306-4603(87)90044-X
Kingma, D.P., Welling, M.: Auto-encoding variational bayes (2013). arXiv preprint arXiv:13126114
Kingma, D.P., Rezende, D.J., Mohamed, S., Welling, M.: Semi-supervised learning with deep generative models. In: Proceedings of the 27th International Conference on Neural Information Processing Systems—Volume 2, MIT Press, Cambridge, NIPS’14, pp. 3581–3589 (2014). http://dl.acm.org/citation.cfm?id=2969033.2969226
Kitamura, R.: Panel analysis in transportation planning: an overview. Transp. Res. Part A: Gen. 24(6), 401–415 (1990). https://doi.org/10.1016/0191-2607(90)90032-2
Maddison, C.J., Mnih, A., Teh, Y.W.: The concrete distribution: a continuous relaxation of discrete random variables (2016). arXiv preprint arXiv:161100712
Mau, P., Eyzaguirre, J., Jaccard, M., Collins-Dodd, C., Tiedemann, K.: The ‘neighbor effect’: simulating dynamics in consumer preferences for new vehicle technologies. Ecol. Econ. 68(1), 504–516 (2008). https://doi.org/10.1016/j.ecolecon.2008.05.007
McFadden, D.: The behavioral science of transportation. Trans. Policy 14(4), 269–274 (2007)
Metz, D.: Mobility of older people and their quality of life. Transp. Policy 7(2), 149–152 (2000). https://doi.org/10.1016/S0967-070X(00)00004-4
Müggenburg, H., Busch-Geertsema, A., Lanzendorf, M.: Mobility biographies: a review of achievements and challenges of the mobility biographies approach and a framework for further research. J. Transp. Geogr. 46, 151–163 (2015). https://doi.org/10.1016/j.jtrangeo.2015.06.004
Nolan, A.: A dynamic analysis of household car ownership. Transp. Res. Part A: Policy Pract. 44(6), 446–455 (2010). https://doi.org/10.1016/j.tra.2010.03.018
Rainforth, T., Kosiorek, A.R., Le, T.A., Maddison, C.J., Igl, M., Wood, F., Teh, Y.W.: Tighter variational bounds are not necessarily better. In: ICML (2018)
Ranzato, M., Poultney, C., Chopra, S., Cun, Y.L.: Efficient learning of sparse representations with an energy-based model. In: Schölkopf, B., Platt, J.C., Hoffman, T. (eds.) Adv. Neural Inf. Process. Syst., vol. 19, pp. 1137–1144. MIT Press, Cambridge (2007)
Rich, J., Hansen, C.O.: The Danish national passenger model–model specification and results. Eur. J. Transp. Infrastruct. Res. (2016). https://doi.org/10.18757/ejtir.2016.16.4.3159
Rich, J., Vandet, C.A.: Is the value of travel time savings increasing? Analysis throughout a financial crisis. Transp. Res. Part A: Policy Pract. 124, 145–168 (2019)
Rolfe, J.T.: Discrete variational autoencoders (2016). arXiv preprint arXiv:160902200
Saadi, I., Mustafa, A., Teller, J., Farooq, B., Cools, M.: Hidden Markov model-based population synthesis. Transp. Res. Part B: Methodol. 90, 1–21 (2016). https://doi.org/10.1016/j.trb.2016.04.007
Schoenduwe, R., Mueller, M.G., Peters, A., Lanzendorf, M.: Analysing mobility biographies with the life course calendar: a retrospective survey methodology for longitudinal data collection. J. Transp. Geogr. 42, 98–109 (2015). https://doi.org/10.1016/j.jtrangeo.2014.12.001
Sohn, K., Yan, X., Lee, H.: Learning structured output representation using deep conditional generative models. In: Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 2, MIT Press, Cambridge, MA, USA, NIPS’15, pp. 3483–3491 (2015). http://dl.acm.org/citation.cfm?id=2969442.2969628
Sun, L., Erath, A.: A Bayesian network approach for population synthesis. Transp. Res. Part C: Emerg. Technol. 61, 49–62 (2015). https://doi.org/10.1016/j.trc.2015.10.010
Sun, L., Erath, A., Cai, M.: A hierarchical mixture modeling framework for population synthesis. Transp. Res. Part B: Methodol. 114, 199–212 (2018). https://doi.org/10.1016/j.trb.2018.06.002
Theis, L., van den Oord, A., Bethge, M.: A note on the evaluation of generative models. In: International Conference on Learning Representations (2016). http://arxiv.org/abs/1511.01844
Verbeek, M., Nijman, T.: Minimum MSE estimation of a regression model with fixed effects from a series of cross-sections. J. Econom. 59(1), 125–136 (1993). https://doi.org/10.1016/0304-4076(93)90042-4
Vij, A., Gorripaty, S., Walker, J.L.: From trend spotting to trend ’splaining: understanding modal preference shifts in the san Francisco bay area. Transp. Res. Part A: Policy Pract. 95, 238–258 (2017). https://doi.org/10.1016/j.tra.2016.11.014
Wooldridge, J.M.: Econometric Analysis of Cross Section and Panel Data. MIT Press, Cambridge (2010). http://www.jstor.org/stable/j.ctt5hhcfr
Xiong, C., Chen, X., He, X., Guo, W., Zhang, L.: The analysis of dynamic travel mode choice: a heterogeneous hidden Markov approach. Transportation 42(6), 985–1002 (2015). https://doi.org/10.1007/s11116-015-9658-2
Zarwi, F.E., Vij, A., Walker, J.L.: A discrete choice framework for modeling and forecasting the adoption and diffusion of new transportation services. Transp. Res. Part C: Emerg. Technol. 79, 207–223 (2017). https://doi.org/10.1016/j.trc.2017.03.004
Acknowledgements
The research leading to these results has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant Agreement No. 713683 (COFUNDfellowsDTU). The authors also thank Mogens Fosgerau for useful discussions. The paper was partially presented on the 8th Symposium of the European Association for Research in Transportation (hEART 2019).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Borysov, S.S., Rich, J. Introducing synthetic pseudo panels: application to transport behaviour dynamics. Transportation 48, 2493–2520 (2021). https://doi.org/10.1007/s11116-020-10137-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11116-020-10137-5