Publications, by date

2011

  1. Distributed Inference and Query Processing for RFID Tracking and MonitoringZhao Cao, Charles Sutton, Yanlei Diao, Prashant Shenoy. Proceedings of the VLDB Endowment (PVLDB) 4 (5). 2011.

    [ .pdf | bib ]

  2. An Introduction to Conditional Random FieldsCharles Sutton, Andrew McCallum. Foundations and Trends in Machine Learning. To appear. 2011.

    [ .pdf | bib | abstract | arXiv ]

  3. Bayesian Inference in Queueing NetworksCharles Sutton, Michael I. Jordan. Annals of Applied Statistics 5 (1). 2011.

    [ .pdf | bib | arXiv ]

  4. Quasi-Newton Markov chain Monte CarloYichuan Zhang, Charles Sutton. In Advances in Neural Information Processing Systems (NIPS). 2011.

    [ .pdf | bib ]

2010

  1. Learning and Inference in Queueing NetworksCharles Sutton, Michael I. Jordan. In Conference on Artificial Intelligence and Statistics (AISTATS). 2010. (Conference version of the longer paper "Bayesian Inference in Queueing Networks".)

    [ .pdf | bib ]

2009

  1. Automatic Exploration of Datacenter Performance RegimesPeter Bodik, Rean Griffith, Charles Sutton, Armando Fox, Michael I. Jordan, David A. Patterson. In First Workshop on Automated Control for Datacenters and Clouds (ACDC '09). 2009.

    [ .pdf | bib ]

  2. Statistical Machine Learning Makes Automatic Control Practical for Internet DatacentersPeter Bodik, Rean Griffith, Charles Sutton, Armando Fox, Michael I. Jordan, David A. Patterson. In Workshop on Hot Topics in Cloud Computing (HotCloud '09). 2009.

    [ .pdf | bib ]

  3. Capturing Data Uncertainty in High-Volume Stream ProcessingYanlei Diao, Boduo Li, Anna Liu, Liping Peng, Charles Sutton, Thanh Tran, Michael Zink. In Conference on Innovative Data Systems Research (CIDR). 2009.

    [ .pdf | bib ]

  4. Misleading learners: Co-opting your spam filterBlaine Nelson, Marco Barreno, Fuching Jack Chi, Anthony D. Joseph, Benjamin I. P. Rubinstein, Udam Saini, Charles Sutton, J. D. Tygar, Kai Xia. In Jeffrey J. P. Tsai and Philip S. Yu, editors. Machine Learning in Cyber Trust: Security, Privacy, Reliability. Springer. 2009.

    [ .pdf | bib ]

  5. Piecewise Training for Structured PredictionCharles Sutton, Andrew McCallum. Machine Learning 77 (2--3). 2009. (Train undirected graphical model by splitting into overlapping parts that are trained independently. Connections to pseudolikelihood and Bethe free energy. Journal version of UAI and ICML papers below.)

    [ .pdf | bib | abstract ]

  6. Probabilistic Inference over RFID Streams in Mobile EnvironmentsThanh Tran, Charles Sutton, Richard Cocci, Yanming Nie, Yanlei Diao, Prashant Shenoy. In International Conference on Data Engineering (ICDE). 2009.

    [ .pdf | bib ]

2008

  1. Unsupervised Deduplication using Cross-field DependenciesRobert Hall, Charles Sutton, Andrew McCallum. In Conference on Knowledge Discovery and Data Mining (KDD). 2008. (Hierarchical DP model that jointly clusters citation venue strings based on both string-edit distance and title information.)

    [ .pdf | bib | abstract ]

  2. Exploiting Machine Learning to Subvert your Spam FilterBlaine Nelson, Marco Barreno, Fuching Jack Chi, Anthony~D. Joseph, Benjamin~I.~P. Rubinstein, Udam Saini, Charles Sutton, J.~D. Tygar, Kai Xia. In Proceedings of the First USENIX Workshop on Large-Scale Exploits and Emergent Threats (LEET). 2008. (Send crafted email to a spam filter to cause it to misclassify your normal email as spam. Initial experiments on defenses to this attack.)

    [ .pdf | bib ]

  3. Probabilistic Inference in Queueing NetworksCharles Sutton, Michael I. Jordan. In Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML). 2008.

    [ .pdf | bib ]

  4. Probabilistic inference in queueing networksCharles Sutton, Michael I. Jordan. In Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SYSML). 2008.

    [ .pdf | bib ]

  5. Bayesian Modeling of Dependency Trees Using Hierarchical Pitman-Yor PriorsHanna Wallach, Charles Sutton, Andrew McCallum. In ICML Workshop on Prior Knowledge for Text and Language Processing. 2008. (Two Bayesian dependency parsing models: 1. Model with Pitman-Yor prior that significantly improves Eisner's classic model; 2. Latent-variable model that learns "syntactic" topics.)

    [ .pdf | bib ]

2007

  1. Response-Time Modeling for Resource Allocation and Energy-Informed SLAsPeter Bodik, Charles Sutton, Armando Fox, David Patterson, Michael I. Jordan. In NIPS Workshop on Statistical Learning Techniques for Solving Systems Problems (MLSys 07). 2007. (Quantile regression (both parametric and non-) for predicting the performance of a web service as a function of workload and power consumption. Much better for voltage control than built-in frequency scaling.)

    [ .pdf | bib ]

  2. Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence DataCharles Sutton, Andrew McCallum, Khashayar Rohanimanesh. Journal of Machine Learning Research 8. 2007. (Combination of dynamic Bayesian networks and conditional random fields. Also considers latent-variable model and cascaded training. Journal version of ICML and EMNLP papers below.)

    [ .pdf | bib | abstract ]

  3. An Introduction to Conditional Random Fields for Relational LearningCharles Sutton, Andrew McCallum. In Lise Getoor and Ben Taskar, editors. Introduction to Statistical Relational Learning. MIT Press. 2007. (Detailed tutorial on conditional random fields. Includes motivation, background, mathematical foundations, linear-chain form, general-structure form, inference, parameter estimation, and tips and tricks. NOTE: In Equation (1.22), there is a small error. There should not be a summation over k in the final term, just lambda_k / sigma_2.)

    [ .pdf | bib ]

  4. Piecewise Pseudolikelihood for Efficient CRF TrainingCharles Sutton, Andrew McCallum. In International Conference on Machine Learning (ICML). 2007. (Train a large CRF in five times faster by dividing it into separate pieces and reducing numbers of predicted variable combinations with pseudolikelihood. Analysis in terms of belief propagation and Bethe energy.)

    [ .pdf | bib | abstract ]

  5. Improved Dynamic Schedules for Belief PropagationCharles Sutton, Andrew McCallum. In Conference on Uncertainty in Artificial Intelligence (UAI). 2007. (Significantly faster version of loopy BP by selecting which messages to send based on an approximation to their residual.)

    [ .pdf | bib | abstract ]

2006

  1. Sparse Forward-Backward using Minimum Divergence Beams for Fast Training of Conditional Random FieldsChris Pal, Charles Sutton, Andrew McCallum. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2006. (New criterion for adaptive beam size within forward-backward, suggested by a variational perspective. Works well within CRF training.)

    [ .pdf | bib | abstract ]

  2. Reducing Weight Undertraining in Structured Discriminative LearningCharles Sutton, Michael Sindelar, Andrew McCallum. In Conference on Human Language Technology and North American Association for Computational Linguistics (HLT-NAACL). 2006. (Trains multiple linear-chain CRFs with different subsets of features, in order to force dependent sets of features to be able to separately model the class label.)

    (This is the published version. An early version had an error in Section 4, under Per-Sequence Mixtures.)

    [ .pdf | bib | abstract ]

2005

  1. Joint Parsing and Semantic Role LabelingCharles Sutton, Andrew McCallum. In Conference on Natural Language Learning (CoNLL). 2005.

    [ .pdf | bib ]

  2. Piecewise Training of Undirected ModelsCharles Sutton, Andrew McCallum. In Conference on Uncertainty in Artificial Intelligence (UAI). 2005. (Train large CRF by dividing into pieces and training independently. The explanation in this paper for why it works is somewhat unsatisfying. Consult journal version (2008) for a better story.)

    [ .pdf | bib | abstract ]

  3. Composition of Conditional Random Fields for Transfer LearningCharles Sutton, Andrew McCallum. In Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP). 2005.

    [ .pdf | bib ]

  4. Learning in Markov Random Fields with Contrastive Free EnergiesMax Welling, Charles Sutton. In Conference on Artificial Intelligence and Statistics (AISTATS). 2005.

    [ .pdf | bib | abstract ]

2004

  1. Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence DataCharles Sutton, Khashayar Rohanimanesh, Andrew McCallum. In International Conference on Machine Learning (ICML). 2004. (Combination of dynamic Bayesian networks and conditional random fields, with experiments in noun-phrase chunking.)

    [ .pdf | bib | abstract ]

  2. Conditional probabilistic context-free grammarsCharles Sutton. Synthesis project (Required for Ph.D. candidacy), University of Massachusetts, 2004.

    [ .pdf | bib ]

2003

  1. Information Theory and Representation in Associative Word LearningBrendan Burns, Charles Sutton, Clayton Morrison, Paul R. Cohen. In Third International Workshop on Epigenetic Robotics. 2003.

    [ .pdf | bib ]

  2. Very Predictive N-grams for Space-Limited Probabilistic ModelsPaul R. Cohen, Charles Sutton. In International Symposium on Intelligent Data Analysis. 2003.

    [ .pdf | bib | abstract ]

  3. Dynamic Conditional Random Fields for Jointly Labeling Multiple SequencesAndrew McCallum, Khashayar Rohanimanesh, Charles Sutton. In NIPS Workshop on Syntax, Semantics, and Statistics. 2003.

    [ .pdf | bib | abstract ]

  4. Guided Incremental Construction of Belief NetworksCharles Sutton, Brendan Burns, Clayton Morrison, Paul R. Cohen. In International Symposium on Intelligent Data Analysis. 2003.

    [ .pdf | bib | abstract ]

2002

  1. Learning Effects of Robot Actions Using Temporal AssociationsPaul R. Cohen, Charles Sutton, Brendan Burns. In International Conference on Development and Learning (ICDL). 2002.

    [ .pdf | bib | abstract ]

  2. Computers and Octi: Report from the 2001 TournamentCharles Sutton. ICGA Journal 25 (2). 2002.

    [ .pdf | bib | abstract ]

Dissertation

  1. Efficient Training Methods for Conditional Random FieldsCharles Sutton. Ph.D. Dissertation, University of Massachusetts, 2008.

    [ .pdf | bib ]

Workshop Presentations

  1. Piecewise Training with Parameter Independence Diagrams: Comparing Globally- and Locally-trained Linear-chain CRFsAndrew McCallum, Charles Sutton. In NIPS Workshop on Learning with Structured Outputs. 2004.

    [ .pdf | bib | abstract ]

  2. Collective Segmentation and Labeling of Distant Entities in Information ExtractionCharles Sutton, Andrew McCallum. In ICML Workshop on Statistical Relational Learning and Its Connections to Other Fields. 2004.

    [ .pdf | bib ]

Technical Reports

  1. Local Training and Belief PropagationCharles Sutton, Tom Minka. Microsoft Research Technical Report, TR-2006-121, 2006.

    [ .pdf | bib ]

  2. Fast, Piecewise Training for Discriminative Finite-state and Parsing ModelsCharles Sutton, Andrew McCallum. Center for Intelligent Information Retrieval Technical Report, IR-403, 2005.

    [ .pdf | bib ]

This page was automatically generated from my BibTeX file using a Ruby script and ERb HTML template. This process was far more pleasant than my previous one, which consisted of some XSLT, some Python and OCaml programs from the net, duct tape, and spackle. The code to generate this page is available (requires bibtool).