ABSTRACT
It is often desirable to restrict attention to part of a data matrix, leaving out either some rows or some columns or both. For example, the columns might subdivide naturally into groups and it would be interesting to analyse each group separately. Or there might be categories corresponding to missing values and one would like to exclude these from the analysis. The most obvious approach would be simply to apply correspondence analysis (CA) to the submatrix of interest. However, one or both of the margins of the submatrix would differ from those of the original data matrix, and so the profiles, masses and distances would change accordingly. The approach presented in this chapter, called subset correspondence analysis, fixes the original margins of the whole matrix, using these to determine the masses and χ2-distance in the analysis of any submatrix. Subset CA has many advantages; for example, the total inertia of the original data matrix is decomposed amongst the subsets, hence the information in a data matrix can be partitioned and investigated separately.
Analysing the consonants and vowels of author data set . . . . . . . . . 161
Subset analysis keeps original margins fixed . . . . . . . . . . . . . . . . 162
Subset CA of consonants, contribution biplot . . . . . . . . . . . . . . . 162
Subset CA of the vowels, contribution biplot . . . . . . . . . . . . . . . 162
Subset MCA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
Subset analysis on an indicator matrix . . . . . . . . . . . . . . . . . . . 165
Subset analysis on a Burt matrix . . . . . . . . . . . . . . . . . . . . . . 165
Subset MCA with rescaled solution and adjusted inertias . . . . . . . . 166
Supplementary points in subset CA . . . . . . . . . . . . . . . . . . . . 166
Supplementary points in subset MCA . . . . . . . . . . . . . . . . . . . 167
SUMMARY: Subset Correspondence Analysis . . . . . . . . . . . . . . . 168