Abstract
In the framework of binary segmentation, we introduce alternative splitting criteria based on the predictability r index of Goodman and Kruskal. We use such splitting criteria in a two-stage predictive splitting procedure. Furthermore, we introduce as stopping rule a statistical test based on the CATANOVA statistic of Light and Margolin. We show an example on a real data set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
BREIMAN, L., FRIEDMAN, J.H., OLSHEN, R.A. and STONE, C.J. (1984): Classification and Regression Trees. Wads worth International Group, Belmont, California.
CIAMPI, A. and THIFFAULT, J. (1987): Recursive partition and Amalgamation (REC-PAM) for censored survival data: criteria for tree selector, Statistical Software Newsletter, n. 2, vol. 14, 78–81.
ELIE, S. (1992): Les methódes d’élagages des arbres de segmentation. D.E.A. thesis in: Controle des systemes, Renault/U.T.C./INRIA.
GELFAND, S.B. (1991): An iterative growing and pruning algorithm for classification tree designai data. TREE Transaction of pattern analysis and machine intelligence, n. 2, vol. 13, 163–174.
GOODMAN, L.A. and KRUSKAL, W.H. (1954): Measures of association for cross-classification. Journal of American Statistical Association, 48, 732–762.
LECHEVALLIER, Y., (1990): Recherche d’une partition optimale sous contrainte d’ordre total. rapports de Recherche N.1247 INRIA
LIGHT, R.J. and MARGOLIN, B.H. (1971): An analysis of variance for categorical data. Journal of American Statistical Association, 66, 534–544.
MOLA, F. and SICILIANO, R. (1992): A two-stage predictive splitting algorithm in binary segmentation. In: Y. Dodge and J. Whittaker (eds.): Computational Statistics. (Compstat ’92 Proceedings). Physica Verlag, vol. 1.
MOLA, F. (1993): Aspetti metodologici e computazionali delle tecniche di segmentazione binaria. Un contributo basato su funzioni di predizione. Tesi di Dottorato in Statistica Computazionale ed Applicazioni, V Ciclo, Università di Napoli.
MORGAN, J.N. and SONQUIST, J.A. (1963): Problems in the analysis of survey data and a proposals. Journal of American Statistical Association, 58, 415–434.
QUINLAN, J.R. (1986): Induction of decision trees. Machine learning, n. 1, 81–106.
UTGOFF, P.E. (1989): Incremental induction of decision trees. Machine learning, n. 4, 161–186.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mola, F., Siciliano, R. (1994). Alternative strategies and CATANOVA testing in two-stage binary segmentation. In: Diday, E., Lechevallier, Y., Schader, M., Bertrand, P., Burtschy, B. (eds) New Approaches in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-51175-2_37
Download citation
DOI: https://doi.org/10.1007/978-3-642-51175-2_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58425-4
Online ISBN: 978-3-642-51175-2
eBook Packages: Springer Book Archive