Introduction/Motivation
- Graphical Methods in Statistics (Fienberg, The American Statistician, 1979) Paper
Continuous Variables and their Distributions
- Smoothed Histograms for Frequency Data on Irregular Intervals (D Scott and W Scott, American Statistician, 2008) Paper
- On Optimal and Data-Based Histograms (also discusses density estimate bandwidths) (D Scott, Biometrika, 1979 Paper
- Data-Based Choice of Histogram Bin Width (M.P. Wand, The American Statistician,1997) (M.P. Wand, The American Statistician, 1997) Paper
- Averaged Shifted Histograms: Effective Nonparametric Density Estimators in Several Dimensions Paper
- Density Estimation (discussion of bandwidths and kernels) (Sheather, Statistical Science, 2004) Paper
- A Brief Survey of Bandwidth Selection for Density Estimation (Jones, Marron, Sheather, JASA 1996) Paper
- The Box-Percentile Plot (Esty and Banfield) Paper
- Beanplot: A Boxplot Alternative for Visual Comparison of Distributions (Kampstra) Paper
- Violin Plots: A Box Plot-Density Trace Synergism (Hintze, Nelson, The American Statistician, 1998) Paper
Relationships and Joint Distributions of Two Continuous Variables
- The Many Faces of a Scatterplot (Cleveland, McGill, Journal of the American Statistical Association, 1984)
- A Suggestion for Sunflower Plots (Schilling, Watkins, The American Statistician, 1994) Paper
- Kernel Smoothing (Wand, Jones, Chapman & Hall, 1995)
- Comparison of smoothing parameterizations in bivariate kernel density estimation (Wand, Jones, 1993)
- Clustering Algorithms (Hartigan, Wiley, 1975) - for level set and cluster tree info
Modeling the Relationship between Two Variables
- Astrostatistics: The Final Frontier (Freeman, Richards, Schafer, Lee, Chance, 2008) Paper
- Hertsprung-Russell Diagram (Wikipedia) Info
- Hipparcos Stars (Penn State) Info
- Applied Linear Regression Models (Kutner, Nachtsheim, Neter; McGraw Hill, 2004)
Linear Models vs. Smoothers
- Applied Linear Regression Models (Kutner, Nachtsheim, Neter; McGraw Hill, 2004)
- Robust Locally Weighted Regression and Smoothing Scatterplots (Cleveland, JASA, 1979) Paper
- LOWESS: A Program for Smoothing Scatterplots by Robust Locally Weighted Regression (Cleveland, The American Statistician, 1981)
- Linear Smoothers and additive models (with discussion) (Buja, Hastie, Tibshirani, Annals of Statistics, 1989) Paper
- Generalized Additive Models (Hastie, Tibshirani; Chapman and Hall, 1990)
- Statistical Models in S (Chambers, Hastie; Wadsworth & Brooks/Cole, 1992)
Multivariate Regression (Trees)
- Applied Linear Regression Models (Kutner, Nachtsheim, Neter; McGraw Hill, 2004)
- The Elements of Statistical Learning (Hastie, Tibshirani, Friedman; Springer, 2013) Book Online
- An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani; Springer, 2013) Book Online
- Classification and Regression Trees (Breiman; Wadsworth, 1984)
Classifiers (Trees, LDA, QDA)
- See last three references in previous list
- Pattern Recognition and Neural Networks (Ripley; Cambridge University Press, 1996)
Icons/Glyphs
- The Use of Faces to Represent Points in K-Dimensional Space Graphically (Chernoff, JASA, 1973) Paper
Clustering
- Clustering Algorithms (Hartigan, Wiley & Sons, 1975)
- Algorithm AS 136: A k-means clustering algorithm (Hartigan and Wong, Applied Statistics, 1979) Paper
- Least Squares Quantization in PCM (Lloyd, IEEE Trans. Information Theory, 1982) Paper
- K-means clustering: A half-century synthesis (Steinley, British Journal of Mathematical and Statistical Psychology, 2006)
- Spherical K-Means Clustering (Hornik, Feinerer, Kober, Buchta; Journal of Statistical Software, 2012) Paper
- A Comparison of Document Clustering Techniques (Steinbach, Karypis, Kumar) Paper
- On Spectral Clustering: Analysis and an algorithm (Ng, Jordan, Weiss; NIPS, 2010) Paper
- A Random Walks View of Spectral Segmentation (Meila, Shi) Paper
- Model-Based Clustering, Discriminant Analysis, and Density Estimation (Fraley, Raftery, Journal of American Statistical Association, 2002) Paper
- mclust Version 4 for R: Normal Mixture Modeling for Model-Based Clustering, Classification, and Density Estimation (Fraley, Raftery, Murphy, Scrucca; 2012) Paper
- Consistency of Single Linkage for High-Density Clusters (Hartigan, Journal of the American Statistical Association, 1981) Paper
- A Generalized Single Linkage Method for Estimating the Cluster Tree of a Density (Stuetzle, Nugent, Journal of Computational and Graphical Statistics; 2010) Paper