Full PDFs may be available on request.
A role for the developing lexicon in phonetic category acquisition. Naomi H. Feldman, Thomas L. Griffiths, Sharon Goldwater, James L. Morgan. . In submission.
Adding sentence types to a model of syntactic category acquisition. Stella Frank, Sharon Goldwater, Frank Keller. TopiCS in Cognitive Science. In press.
Statistical Learning, Inductive Bias, and Bayesian Inference in Language Acquisition. Lisa Pearl, Sharon Goldwater. In Jeffrey Lidz, editors. Oxford Handbook of Developmental Linguistics. Oxford University Press. In press.
[ bib ]
A summary of the 2012 JHU CLSP Workshop on Zero Resource Speech Technologies and Models of Early Language Acquisition. Aren Jansen, Emmanuel Dupoux, Sharon Goldwater, Mark Johnson, Sanjeev Khudanpur, Kenneth Church, Naomi Feldman, Hynek Hermansky, Florian Metze, Richard Rose, Mike Seltzer, Pascal Clark, Ian McGraw, Balakrishnan Varadarajan, Erin Bennett, Benjamin Borschinger, Justin Chiu, Ewan Dunbar, Abdallah Fourtassi, David Harwath, Chia-ying Lee, Keith Levin, Atta Norouzian, Vijay Peddinti, Rachel Richardson, Thomas Schatz, Samuel Thomas.
In Proceedings of ICASSP. 2013.
[
pdf |
bib
| abstract
]
Unsupervised dependency parsing with acoustic cues. John K. Pate, Sharon Goldwater.
Transactions of the Association for Computational Linguistics 1, pp. 63--74.
2013.
[
pdf |
bib
| abstract
]
Minimally-Supervised Morphological Segmentation using Adaptor Grammars. Kairit Sirts, Sharon Goldwater.
Transactions of the Association for Computational Linguistics.
2013.
(Volume/page numbers not yet assigned)
[
pdf |
bib
| abstract
]
Turning the pipeline into a loop: Iterated unsupervised dependency parsing and PoS induction. Christos Christodoulopoulos, Sharon Goldwater, Mark Steedman.
In Proceedings of the NAACL-HLT Workshop on the Induction of Linguistic Structure. 2012.
[
pdf |
bib
]
Bootstrapping a Unified Model of Lexical and Phonetic Acquisition. Micha Elsner, Sharon Goldwater, Jacob Eisenstein.
In Proceedings of the 50th Annual Meeting of the Association of Computational Linguistics. 2012.
[
pdf |
bib
| abstract
]
Semantic Parsing with Bayesian Tree Transducers. Bevan K. Jones, Mark Johnson, Sharon Goldwater.
In Proceedings of the 50th Annual Meeting of the Association of Computational Linguistics. 2012.
[
pdf |
bib
| abstract
]
A Probabilistic Model of Syntactic and Semantic Acquisition from Child-Directed Utterances and their Meanings. Tom Kwiatkowski, Sharon Goldwater, Luke Zettelmoyer, Mark Steedman.
In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. 2012.
[
pdf |
bib
| abstract
]
A Bayesian mixture model for part-of-speech induction using multiple features. Christos Christodoulopoulos, Sharon Goldwater, Mark Steedman.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011.
[
pdf |
bib
| abstract
]
Producing power-law distributions and damping word frequencies with two-stage language models. Sharon Goldwater, Thomas L. Griffiths, Mark Johnson.
Journal of Machine Learning Research 12(Jul), pp. 2335--2382.
2011.
[
pdf |
bib
| abstract
]
Formalizing Semantic Parsing with Tree Transducers. Bevan K. Jones, Mark Johnson, Sharon Goldwater.
In Proceedings of the Australasian Language Technology Workshop. 2011.
[
pdf |
bib
| abstract
]
Lexical generalization in CCG grammar induction for semantic parsing. Tom Kwiatkowski, Luke Zettelmoyer, Sharon Goldwater, Mark Steedman.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011.
[
pdf |
bib
| abstract
]
Unsupervised extraction of recurring words from infant-directed speech. Fergus R. McInnes, Sharon Goldwater.
In Proceedings of the 33rd Annual Conference of the Cognitive Science Society. 2011.
[
pdf |
bib
| abstract
]
Predictability effects in adult-directed and infant-directed speech: Does the listener matter?. John K. Pate, Sharon Goldwater.
In Proceedings of the 33rd Annual Conference of the Cognitive Science Society. 2011.
[
pdf |
bib
| abstract
]
Unsupervised syntactic chunking with acoustic cues: Computational models for prosodic bootstrapping. John K. Pate, Sharon Goldwater.
In Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics. 2011.
(Received Best Student Paper award.)
[
pdf |
bib
| abstract
]
Two decades of unsupervised POS induction: How far have we come?. Christos Christodoulopoulos, Sharon Goldwater, Mark Steedman.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2010.
(This is an updated version that corrects a minor bug in the computation of the vmb measure in Figure 1 and Table 1. Thanks to Andreas Zollmann for pointing this out.)
[
pdf |
bib
| abstract
]
Inducing tree substitution grammars. Trevor Cohn, Phil Blunsom, Sharon Goldwater.
Journal of Machine Learning Research 11(Nov), pp. 3053--3096.
2010.
[
preprint pdf |
bib
| abstract
| online journal
]
Beyond transitional probabilities: Human learners impose a parsimony bias in statistical word segmentation. Michael C. Frank, Inbal Arnon, Harry Tily, Sharon Goldwater.
In Proceedings of the 32nd Annual Conference of the Cognitive Science Society. 2010.
[
pdf |
bib
| abstract
]
Modeling human performance in statistical word segmentation. Michael C. Frank, Sharon Goldwater, Thomas L. Griffiths, Joshua B. Tenenbaum.
Cognition 117 (2), pp. 107--125.
2010.
[
preprint pdf |
bib
| abstract
| online journal
]
Using sentence type information for syntactic category acquisition. Stella Frank, Sharon Goldwater, Frank Keller.
In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics at ACL. 2010.
[
pdf |
bib
| abstract
]
Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates. Sharon Goldwater, Daniel Jurafsky, Christopher D. Manning.
Speech Communication 52 (3), pp. 181--200.
2010.
[
preprint pdf |
bib
| abstract
| online journal
]
Inducing probabilistic CCG grammars from logical form with higher-order unification. Tom Kwiatkowski, Luke Zettelmoyer, Sharon Goldwater, Mark Steedman.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2010.
[
pdf |
bib
| abstract
]
How ideal are we? Incorporating human limitations into Bayesian models of word segmentation. Lisa Pearl, Sharon Goldwater, Mark Steyvers.
In Proceedings of the 34th annual Boston University Conference on Child Language Development, pp. 315--326. 2010.
[
pdf |
bib
]
Online Learning Mechanisms for Bayesian Models of Word Segmentation. Lisa Pearl, Sharon Goldwater, Mark Steyvers.
Research on Language and Computation 8 (2), pp. 107-132.
2010.
[
preprint pdf |
bib
| abstract
| online journal
]
A note on the implementation of Hierarchical Dirichlet Processes. Phil Blunsom, Trevor Cohn, Sharon Goldwater, Mark Johnson.
In Proceedings of the 47th Annual Meeting of the Association of Computational Linguistics. 2009.
(This is an updated version with corrected pseudocode for Algorithm 1: line 16 was previously missing. Thanks to Weng Wei for pointing this out.)
[
pdf |
bib
| abstract
]
Inducing compact but accurate tree-substitution grammars. Trevor Cohn, Sharon Goldwater, Phil Blunsom.
In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 548--556. 2009.
[
pdf |
bib
| abstract
]
Evaluating models of syntactic category acquisition without using a gold standard. Stella Frank, Sharon Goldwater, Frank Keller.
In Proceedings of the 31st Annual Conference of the Cognitive Science Society. 2009.
[
pdf |
bib
| abstract
]
A Bayesian framework for word segmentation: Exploring the effects of context. Sharon Goldwater, Thomas L. Griffiths, Mark Johnson.
Cognition 112 (1), pp. 21--54.
2009.
(Results in this paper are based on a newer version of the code used in the ACL06 and BUCLD07 word segmentation papers and chapter 5 of my thesis. The new version corrects a small bug in the implementation of the bigram (HDP) model. Please cite results from this paper in future publications.)
[
preprint pdf |
bib
| abstract
| online journal
]
Improving nonparametric Bayesian inference: Experiments on unsupervised word segmentation with adaptor grammars. Mark Johnson, Sharon Goldwater.
In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2009.
[
pdf |
bib
| abstract
]
Improving morphology induction by learning spelling rules. Jason Naradowsky, Sharon Goldwater.
In Proceedings of IJCAI. 2009.
[
pdf |
bib
| abstract
]
Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase ASR error rates. Sharon Goldwater, Daniel Jurafsky, Christopher D. Manning.
In Proceedings of ACL-08: HLT. 2008.
[
pdf |
bib
| abstract
]
Modeling Human Performance in Statistical Word Segmentation. Michael C. Frank, Sharon Goldwater, Vikash Mansinghka, Thomas L. Griffiths, Joshua B. Tenenbaum.
In Proceedings of the 29th Annual Conference of the Cognitive Science Society. 2007.
[
pdf |
bib
| abstract
]
Distributional cues to word segmentation: Context is important. Sharon Goldwater, Thomas L. Griffiths, Mark Johnson.
In Proceedings of the 31st Boston University Conference on Language Development. 2007.
(If you plan to cite results from this paper, see this note.)
[
pdf |
bib
]
A Fully Bayesian Approach to Unsupervised Part-of-Speech Tagging. Sharon Goldwater, Thomas L. Griffiths.
In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics. 2007.
[
pdf |
bib
| abstract
]
Adaptor Grammars: a Framework for Specifying Compositional Nonparametric Bayesian Models. Mark Johnson, Thomas L. Griffiths, Sharon Goldwater.
In Advances in Neural Information Processing Systems 19, pp. 641--648. 2007.
(This is an updated version that fixes a typo in equation 4. Thanks to Julia Hockenmaier for pointing this out.)
[
pdf |
bib
| abstract
]
Bayesian Inference for PCFGs via Markov chain Monte Carlo. Mark Johnson, Thomas L. Griffiths, Sharon Goldwater.
In Proceedings of Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics. 2007.
[
pdf |
bib
| abstract
]
Contextual Dependencies in Unsupervised Word Segmentation. Sharon Goldwater, Thomas L. Griffiths, Mark Johnson.
In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics. 2006.
[
pdf |
bib
| abstract
]
Interpolating Between Types and Tokens by Estimating Power-Law Generators. Sharon Goldwater, Thomas L. Griffiths, Mark Johnson.
In Advances in Neural Information Processing Systems 18, pp. 459--466. 2006.
(This is a corrected version.)
[
pdf |
bib
| abstract
]
Nonparametric Bayesian Models of Lexical Acquisition. Sharon Goldwater.
Ph.D. Dissertation, Brown University, 2006.
(This is the tree-saving version of my thesis, single-spaced with minimal front matter. The official version is double-spaced and contains more front matter. If you plan to cite results on word segmentation, see this note.)
[
pdf |
bib
]
A Non-Parametric Bayesian Approach to Spike Sorting. Frank Wood, Sharon Goldwater, Michael Black.
In Proceedings of the 28th IEEE Conference on Engineering in Medicine and Biologicial Systems. 2006.
[
pdf |
bib
| abstract
]
Representational Bias in Unsupervised Learning of Syllable Structure. Sharon Goldwater, Mark Johnson.
In Proceedings of the Ninth Conference on Computational Natural Language Learning (CONLL '05). 2005.
[
pdf |
bib
| abstract
]
Improving Statistical MT Through Morphological Analysis. Sharon Goldwater, David McClosky.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2005.
[
pdf |
bib
| abstract
]
Priors in Bayesian Learning of Phonological Rules. Sharon Goldwater, Mark Johnson.
In Proceedings of the Seventh Meeting of the ACL Special Interest Group in Computational Phonology (SIGPHON '04). 2004.
[
pdf |
bib
| abstract
]
Statically finding errors in spreadsheets. Yanif Ahmad, Tudor Antoniu, Sharon Goldwater, Shriram Krishnamurthi.
In Proceedings of the IEEE International Conference on Software Engineering. 2003.
[
pdf |
bib
| abstract
]
Learning OT Constraint Rankings Using a Maximum Entropy Model. Sharon Goldwater, Mark Johnson.
In Proceedings of the Workshop on Variation within Optimality Theory, pp. 113--122. 2003.
[
pdf |
bib
| abstract
]
Building a Robust Dialogue System with Limited Data. Sharon Goldwater, Elizabeth Owen Bratt, Jean-Mark Gawron, John Dowding.
In Proceedings of the NAACL Workshop on Conversational Systems. 2000.
[
pdf |
bib
]
Compiling language models from a linguistically motivated unification grammar. Rayner, Manny, Hockey, Beth Ann, James, Frankie, Bratt, Elizabeth Owen, Goldwater, Sharon, Gawron, Jean Mark.
In Proceedings of the 18th Conference on Computational linguistics, Volume 2, pp. 670--676. 2000.
[
pdf |
bib
| abstract
]
Interpreting language in context in CommandTalk. John Dowding, Elizabeth Owen Bratt, Sharon Goldwater.
In Proceedings of Communicative Agents: The Use of Natural Language in Embodied Systems. 1999.
[
pdf |
bib
]
Edge-based best-first chart parsing. Eugene Charniak, Sharon Goldwater, Mark Johnson.
In Proceedings of the Sixth Workshop on Very Large Corpora at COLING-ACL. 1998.
[
pdf |
bib
]
Thanks to Charles Sutton for the scripts used to generate this page automatically from my BibTeX file. You too can download the scripts here.