Abstract
A key concept of the forward search algorithm in confirmatory factor analysis is ordering of the data on the basis of observational residuals. These residuals are computed under the proposed model and measure the discrepancy between the observed and predicted response for each unit of the sample. Regression-type factor scores are used to estimate model predictions. Informative forward plots are created for indexing influential observations and to show the dynamics of the estimates throughout the search. The detailed influence of each observation on the model parameters and fit indices is analyzed and a robust model inference is achieved. Real and simulated data sets with known contamination schemes are used to demonstrate the performance of the forward search algorithm.
Similar content being viewed by others
Notes
CFI stands for comparative fit index. Similar to TLI, it compares the proposed model with the baseline model.
These changes cannot be seen in the plots of Fig. 3.
A linear regression of \(\widehat{\beta }_1\) on a constant and basic set size was used.
References
Atkinson AC (1994) Fast very robust methods for the detection of multiple outliers. J Am Stat Assoc 89:1329–1339
Atkinson AC, Riani M (2000) Robust diagnostic regression analysis. Springer, New York
Atkinson AC, Riani M, Cerioli A (2004) Exploring multivariate data with the forward search. Springer, New York
Bentler PM, Bonett DG (1980) Significance tests and goodness of fit in the analysis of covariance structures. Psychol Bull 88:588–606
Bollen KA (1989) Structural equations with latent variables. Wiley, New York
Bollen KA, Arminger G (1991) Observational residuals in factor analysis and structural equation models. Sociol Methodol 21:235–262
Breckler SJ (1990) Application of covariance structure modeling in psychology: cause for concern? Psychol Bull 107:260–273
Brown TA (2006) Confirmatory factor analysis for applied research. The Guilford Press, New York
Campbell NA (1980) Robust procedures in multivariate analysis I: robust covariance estimation. J Roy Statist Soc Ser C (Appl Stat) 29:231–237
Chow GC (1960) Tests of equality between sets of coefficients in two linear regressions. Econometrica 28:591–605
Cook RD (1977) Detection of influential observations in linear regression. Technometr 19:15–18
Cudeck R, MacCallum RC (eds) (2007) Factor analysis at 100: historical developments and future directions. Lawrence Erlbaum Associates, Mahwah
Dupuis Lozeron E, Victoria-Feser MP (2009) Robust estimation of constrained covariance matrices for confirmatory factor analysis. Comput Stat Data Anal 54:3020–3032
Fan X, Thompson B, Wang L (1999) Effects of sample size, estimation methods, and model specification on structural equation modeling fit indexes. Struct Equ Model 6:56–83
Hadi AS (1992) Identifying multiple outliers in multivariate data. J Roy Statist Soc Ser B (Methodol) 54:761–771
Hallquist M (2011) MplusAutomation: Automating Mplus model estimation and interpretation. R package version 0.5.
Holzinger KJ, Swineford F (1939) A study in factor analysis: The stability of a bi-factor solution. Supplementary Educational Monographs 48. University of Chicago, Chicago
Hu L, Bentler PM (1998) Fit indices in covariance structural equation modeling: sensitivity to underparameterized model misspecification. Psychol Method 3:424–453
Hu L, Bentler PM (1999) Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model 6:1–55
Maronna RA, Martin RD, Yohai VJ (2006) Robust statistics: theory and methods. Wiley, Chichester
Mavridis D, Moustaki I (2008) Detecting outliers in factor analysis using the forward search algorithm. Multivar Behav Res 43:453–475
Mavridis D, Moustaki I (2009) The forward search algorithm for detecting aberrant response patterns in factor analysis for binary data. J Comput Graph Stat 18:1016–1034
Muthén LK, Muthén BO (2010) Mplus, version 6.1. Muthén & Muthén, Los Angeles
Pek J, MacCallum RC (2011) Sensitivity analysis in structural equation models: cases and their influence. Multivar Behav Res 46:202–228
Pison G, Rousseeuw PJ, Filzmoser P, Croux C (2003) Robust factor analysis. J Multivar Anal 84:145–172
Pison G, Van Aelst S, Willems G (2002) Small sample corrections for LTS and MCD. Metrika 55:111–123
Poon W-Y, Wong Y-K (2004) A forward search procedure for identifying influential observations in the estimation of a covariance matrix. Struct Equ Model 11:357–374
R Development Core Team (2011) R: A language and environment for statistical computing, version 2.13.2. R Foundation for Statistical Computing, Vienna
Rensvold RB, Cheung GW (1999) Identification of influential cases in structural equation models using the jackknife method. Organ Res Method 2:293–308
Rosseel Y with contributions by Oberski D (2011) lavaan: Latent variable analysis. R package version 0.4-9.
Rousseeuw PJ (1984) Least median of squares regression. J Am Stat Assoc 79:871–880
Rousseeuw PJ (1985) Multivariate estimation with high breakdown point. In: Grossmann W, Pflug G, Vincze I, Wertz W (eds) Mathematical statistics and applications. Reidel, Dordrecht, pp 283–297
Rousseeuw PJ, Croux C, Todorov V, Ruckstuhl A, Salibian-Barrera M, Verbeke T, Koller M, Maechler M (2011) robustbase: Basic robust statistics. R package version 0.8-0.
Rousseeuw PJ, Van Driessen K (1999) A fast algorithm for the minimum covariance determinant estimator. Technometr 41:212–223
Rousseeuw PJ, Van Zomeren BC (1990) Unmasking multivariate outliers and leverage points. J Am Stat Assoc 85:633–651
Venables WN, Ripley BD (2002) Modern applied statistics with S-Plus, 4th edn. Springer, New York
Yang W, Tanaka Y, Nakaya J (2006) Forward search algorithm for robust influence analysis in maximum likelihood factor analysis. Int J Comput Sci Netw Secur 6:43–49
Yuan K-H, Bentler PM (1998) Structural equation modeling with robust covariances. Sociol Methodol 28:363–396
Yuan K-H, Bentler PM (2000) Robust mean and covariance structure analysis through iteratively reweighted least squares. Psychometrika 65:43–58
Yuan K-H, Bentler PM (2001) Effect of outliers on estimators and tests in covariance structure analysis. Br J Math Stat Psychol 54:161–175
Yuan K-H, Hayashi K (2006) Standard errors in covariance structure models: asymptotics versus bootstrap. Br J Math Stat Psychol 59:397–417
Zeileis A, Kleiber C, Krämer W, Hornik K (2003) Testing and dating of structural changes in practice. Comput Stat Data Anal 44:109–123
Acknowledgments
The research was supported by the Slovenian Research agency. The author would like to thank Prof. Matjaž Omladič for tireless assistance and support during the conduct of this research and the preparation of this paper. Special thanks also go to the anonymous reviewers. Their comments and suggestions led to significant improvements to the initial manuscript.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Toman, A. Robust confirmatory factor analysis based on the forward search algorithm. Stat Papers 55, 233–252 (2014). https://doi.org/10.1007/s00362-013-0525-y
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-013-0525-y
Keywords
- Confirmatory factor analysis
- Robust estimation
- Forward search algorithm
- Exploratory data analysis
- Outlier
- Influential observation