![]() |
Charles SuttonLecturer (== US Assistant Professor)School of Informatics University of Edinburgh Publications Office: IF 3.26 Voice (W): +44 (0) 131 651 5634 Skype: casutton (calls forward to my mobile) Email: csutton@inf.ed.ac.uk |
Computer systems not only process more data than ever before, but are also a source of data, in particular, data that concern their own operation. My research aims at new statistical machine learning methods designed to handle data about the operation and performance of large-scale computer systems. The ultimate goal is to improve techniques for developing, managing, and debugging computer systems. I believe that this is a rich source of applications that make fundamentally new demands on learning algorithms, encouraging the development of new machine learning methods.
Also, I maintain an interest in natural language processing, including structured prediction methods, conditional random fields, and graphical modeling approches to NLP.
Although these applications are disparate, they are connected by an underlying statistical methodology. I tend toward approaches based on probabilistic models whose parameters or structure can be estimated from data, often relying on techniques for approximate inference in graphical models.
I am part of a large machine learning group at Edinburgh.
Here is some information for prospective students in the group.
My full list of publications is available. Or you might be interested in these recent highlights:
Bayesian Inference in Queueing Networks. Charles Sutton, Michael I. Jordan. 2010. arXiv stat.ML/1001.3355
Piecewise Training for Structured Prediction. Charles Sutton, Andrew McCallum. Machine Learning 77 (2--3). 2009. (Train undirected graphical model by splitting into overlapping parts that are trained independently. Connections to pseudolikelihood and Bethe free energy. Journal version of UAI and ICML papers below.)
Probabilistic Inference in Queueing Networks. Charles Sutton, Michael I. Jordan. In Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML). 2008.
Unsupervised Deduplication using Cross-field Dependencies. Robert Hall, Charles Sutton, Andrew McCallum. In Conference on Knowledge Discovery and Data Mining (KDD). 2008. (Hierarchical DP model that jointly clusters citation venue strings based on both string-edit distance and title information.)
An Introduction to Conditional Random Fields for Relational Learning. Charles Sutton, Andrew McCallum. In Lise Getoor and Ben Taskar, editors. Introduction to Statistical Relational Learning. MIT Press. 2007. (Detailed tutorial on conditional random fields. Includes motivation, background, mathematical foundations, linear-chain form, general-structure form, inference, parameter estimation, and tips and tricks. NOTE: In Equation (1.22), there is a small error. There should not be a summation over k in the final term, just lambda_k / sigma_2.)
Finally, I have a collection of brief, tutorial-style research notes.
Here is a list of low-cost and open-source software that I enjoy using.