Skip to main content
Log in

Clustering longitudinal ordinal data via finite mixture of matrix-variate distributions

  • Original Paper
  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

In social sciences, studies are often based on questionnaires asking participants to express ordered responses several times over a study period. We present a model-based clustering algorithm for such longitudinal ordinal data. Assuming that an ordinal variable is the discretization of an underlying latent continuous variable, the model relies on a mixture of matrix-variate normal distributions, accounting simultaneously for within- and between-time dependence structures. The model is thus able to concurrently model the heterogeneity, the association among the responses and the temporal dependence structure. An EM algorithm is developed and presented for parameters estimation, and approaches to deal with some arising computational challenges are outlined. An evaluation of the model through synthetic data shows its estimation abilities and its advantages when compared to competitors. A real-world application concerning changes in eating behaviors during the Covid-19 pandemic period in France will be presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

Download references

Acknowledgements

This work has been realised thanks to the financial support provided by Project IADoc@UdL of the University of Lyon and Université Lumière - Lyon 2 as part of the call for "doctoral contracts in artificial intelligence 2020" (ANR-20-THIA-0007-01). We want to thank Agnès François-Lecompte, Morgane Innocent and Dominique Kréziak, co-authors for their work in François-Lecompte et al. (2020) for sharing their data. We would also like to thank Brendan Murphy for his invaluable inputs and support throughout the research process. His insights and expertise were instrumental in shaping the direction of this project.

Author information

Authors and Affiliations

Authors

Contributions

FA and JJ wrote the main manuscript text, designed and implemented the model. IP-A provided the data, contributed and supervised the real-world application. All authors reviewed the manuscript.

Corresponding author

Correspondence to Francesco Amato.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A

See Tables 2, 3, 4, 5 and 6.

Table 2 Clusters’ means over time. The estimated parameter \(\hat{\pi }\) = (0.37,0.44,0.19)
Table 3 Clusters’ time correlation
Table 4 Clusters’ time covariances
Table 5 Clusters’ variables correlation
Table 6 Clusters’ variables covariances

Appendix B

See Fig. 10.

Fig. 10
figure 10

Visualization of BIC for K as results of application on real data. Kmeans++ initialization

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Amato, F., Jacques, J. & Prim-Allaz, I. Clustering longitudinal ordinal data via finite mixture of matrix-variate distributions. Stat Comput 34, 81 (2024). https://doi.org/10.1007/s11222-024-10390-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11222-024-10390-z

Keywords

Navigation