Michael Friendly.
Corrgrams: Exploratory Displays for Correlation Matrices.
In The American Statistician, vol. 56, no. 4, pp. 316--324, 2002.


Links:

Abstract:

Correlation and covariance matrices provide the basis for all classical multivariate techniques. Many statistical tools exist for analyzing their structure but, surprisingly, there are few techniques for exploratory visual display, and for depicting the patterns of relations among variables in such matrices directly, particularly when the number of variables is moderately large. This article describes a set of techniques we subsume under the name "corrgram," based on two main schemes: (a) Rendering the value of a correlation to depict its sign and magnitude. We consider some of the properties of several iconic representations, in relation to the kind of task to be performed. (b) Reordering the variables in a correlation matrix so that "similar" variables are positioned adjacently, facilitating perception. In addition, the extension of this visualization to matrices for conditional independence and partial independence is described and illustrated, and we provide an easily used SAS implementation of these methods.

Bibtex:

@Article{        friendly:2002:EDCM,
  author = 	 {Michael Friendly},
  title = 	 {Corrgrams: Exploratory Displays for Correlation Matrices},
  journal = 	 {The American Statistician},
  year = 	 {2002},
  volume = 	 {56},
  number = 	 {4},
  pages = 	 {316--324},
  month = 	 {November},
}

Images:

References:

Asimov, D. (1985), "Grand Tour," SIAM Journal of Scientific and Statistical Computing, 6, 128-143.
Breiger, R. L., Boorman, A. S., and Arabie, P. (1975), "An Algorithm for Clustering Relational Data With Applications to Social Network Analysis and Comparison With Multidimensional Scaling," Journal of Mathematical Psychology, 12, 328-383.
Chambers, J. M., Cleveland, W. S., Kleiner, B., and Tukey, P. A. (1983), Graphical Methodsfor Data Analysis, Belmont, CA: Wadsworth.
Chen, C. H. (1996), "The Properties and Applications of the Convergence of Correlation Matrices," in Proceedings of the Statistical Computing Section, Alexandria, VA: American Statistical Association, pp. 49-54.
(1999), "Extensions of Generalized Association Plots (GAP)," in Proceedings of the Statistical Computing Section, Alexandria, VA: American Statistical Association, pp. 111-116.
Cleveland, W. S. (1993), Visualizing Data, Summit, NJ: Hobart Press.
Dempster, A. P. (1969), Elements of Continuous Multivariate Analysis, Reading, MA: Addison-Wesley.
Dobkins, K. R., Gunther, K. L., and Peterzell, D. H. (2000), "What Covariance Mechanisms Underlie Green/Red Equiluminance, Luminance Contrast Sensitivity and Chromatic (Green/Red) Contrast Sensitivity?" Vision Research, 40, 613-628.
Falissard, B. (1996), "A Spherical Representation of a Correlation Matrix," Journal of Classification, 13, 267-280.
(1999), "Focused Principal Component Analysis: Looking at a Correlation Matrix With a Particular Interest in a Given Variable," Journal of Computational and Graphical Statistics, 8, 906-912.
Friedman, J. (1987), "Exploratory Projection Pursuit," Journal of the American Statistical Association, 82, 249-266.
Friendly, M. (1991), SAS System for Statistical Graphics, Cary, NC: SAS Institute.
(1999), "Extending Mosaic Displays: Marginal, Conditional, and Partial Views of Categorical Data," Journal of Computational and Graphical Statistics, 8, 373-395.
Friendly, M., and Kwan, E. (in press), "Effect Ordering for Data Displays," Computational Statistics and Data Analysis, 37.
Gabriel, K. R. (1971), "The Biplot Graphic Display of Matrices With Application to Principal Components Analysis," Biometrics, 58, 453-467.
Gruvaeus, G. and Wainer, H. (1972), "Two Additions to Hierarchical Cluster Analysis," The British Journal of Mathematical and Statistical Psychology, 25, 200-206.
Hills, M. (1969), "On Looking at Large Correlation Matrices," Biometrika, 56, 249-253.
Hoaglin, D. C., and Velleman, P. F. (1994), "A Critical Look at Some Analyses of Major League Baseball Salaries," The American Statistician, 49, 277-285.
McQuitty, L. L. (1968), "Multiple Clusters, Types, and Dimensions From Iterative Iintercolumnar Correlational Analysis," Multivariate Behavioral Research, 3, 465-477.
Murdoch, D. J., and Chow, E. D. (1996), "A Graphical Display of Large Correlation Matrices," The American Statistician, 50, 178-180.
Paolini, G. V., and Santangelo, P. (1991), "An Interactive Graphic Tool to Plot the Structure of Large Sparce Matrices," IBM Journal of Research and Development, 35, 231-237.
Tukey, J. W. (1977), Exploratory Data Analysis, Reading, MA: Addison Wesley.
Whittaker, J. (1990), Graphical Models in Applied Multivariate Statistics, New York: Wiley.