The Computer Journal Advance Access published online on January 27, 2008
The Computer Journal, doi:10.1093/comjnl/bxm121
| ||||||||||||||||||||||||||||||||||||||||||||||||||||
Wallace's Approach to Unsupervised Learning: The Snob Program
1 Department of Statistics, University of Waikato, Hamilton, New Zealand
2 Department of Mathematics, University of Queensland, St. Lucia, Brisbane, Qld 4072, Australia
* Corresponding author: maj{at}stats.waikato.ac.nz
We describe the Snob program for unsupervised learning as it has evolved from its beginning in the 1960s until its present form. Snob uses the minimum message length principle expounded in Wallace and Freeman (Wallace, C.S. and Freeman, P.R. (1987) Estimation and inference by Compact coding. J. Roy. Statist. Soc. Ser. B, 49, 240–252.) and we indicate how Snob estimates class parameters using the approach of that paper. We will survey the evolution of Snob from these beginnings to the state that it has reached as described by Wallace and Dowe (Wallace, C.S. and Dowe, D.L. (2000) MMM mixture modelling of multi-state, Poisson, Von Mises Circular and Gaussian distributions. Stat. Comput., 10, 73–83.) We pay particular attention to the revision of Snob in the 1980s where definite assignment of things to classes was abandoned.
Key Words: computer program EM algorithm minimum message length mixture model cluster analysis