Philipp Koehn

University of Edinburgh
School of Informatics
10 Crichton Street, room 4.19
Edinburgh, EH8 9AB, United Kingdom

Email: pkoehn@inf.ed.ac.uk
Tel: +44 131 650-8287
Fax: +44 131 650-6626
Web: http://www.iccs.informatics.ed.ac.uk/~pkoehn/

Research Interests

Natural Language Processing, Machine Translation, Machine Learning.
My research focuses on developing and understanding data-driven methods to solve long-standing real-world problem such as machine translation.

Education

Doctor of Philosophy, Computer Science
Department of Computer Science, University of Southern California
Thesis title: Noun Phrase Translation
Thesis advisor: Prof. Kevin Knight
December 2003

Diplom, Computer Science
Department of Computer Science, Universität Erlangen-Nürnberg
Thesis title: Statistical and Model-Based Learning Methods for Extending Unification Grammar
Thesis advisors: Prof. Günther Görz
October 1996

Master of Science, Computer Science
Department of Computer Science, University of Tennessee, Knoxville
Thesis title: Combining Genetic Algorithms and Neural Networks
Thesis advisor: Prof. Bruce MacLennan
(on Fulbright Grant)
December 1994

Professional Experience

Professor
University of Edinburgh, United Kingdom
since August 2012

Reader
University of Edinburgh, United Kingdom
August 2010 - July 2012

Lecturer
University of Edinburgh, United Kingdom
January 2005 - July 2010

Post-doctorial Associate
Massachusetts Institute of Technology, Cambridge, MA
February 2004 - January 2005

Research Assistant
USC Information Sciences Institute, Los Angeles, CA
August 1997 - December 2003

Summer Research Intern
Whizbang! Labs, Provo, UT
Under guidance of Dallan Quass
May-August 2000

Summer Manager
AT&T Research - Labs, Florham Park, NJ
Under guidance of Steve Abney, Michael Collins and Julia Hirschberg
May-August 1999

Internet Consultant
Netzmarkt, Erlangen, Germany
May 1996 - June 1997 (full time)
July 1997 - December 2000 (part time)

Teaching Assistant
Computer Science Department
University of Tennessee, Knoxville, TN
Fall 1994

Teaching Assistant
Mathematics Department
Universität Erlangen-Nürnberg, Germany
Winter 1992-1993 and Summer 1993

Consulting

Book

  1. Philipp Koehn: "Statistical Machine Translation", text book, Cambridge University Press, 2009.

Conference Papers

  1. Philipp Koehn and Barry Haddow: "Interpolated Backoff for Factored Translation Models", Meeting of the Association for Machine Translation of the Americas (AMTA), 2012.
  2. Kenneth Heafield, Philipp Koehn and Alon Lavie: "Language Model Rest Costs and Space-Efficient Storage", Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2012.
  3. Kenji Sagae, Maider Lehr, Emily Prud'hommeaux, Puyang Xu, Nathan Glenn, Damianos Karakos, Sanjeev Khudanpur, Brian Roark, Murat Saraçlar, Izhak Shafran, Daniel Bikel, Chris Callison-Burch, Yuan Cao, Keith Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley: "Hallucinated N-Best Lists for Discriminative Language Modeling", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  4. Puyang Xu, Sanjeev Khudanpur, Maider Lehr, Emily Prud'hommeaux, Nathan Glenn, Damianos Karakos, Brian Roark, Kenji Sagae, Murat Saraçlar, Izhak Shafran, Dan Bikel, Chris Callison-Burch, Yuan Cao, Keith Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley: "Continuous Space Discriminative Language Modeling", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  5. Arda Çelebi, Haçim Sak, Erinç Dikici, Murat Saraçlar, Maider Lehr, Emily Tucker Prud'hommeaux, Puyang Xu, Nathan Glenn, Damianos Karakos, Sanjeev Khudanpur, Brian Roark, Kenji Sagae, Izhak Shafran, Dan Bikel, Chris Callison-Burch, Yuan Cao, Keith Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley: "Semi-Supervised Discriminative Language Modeling for Turkish ASR", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2012
  6. Damianos Karakos, Brian Roark, Izhak Shafran, Kenji Sagae, Maider Lehr, Emily Prud'hommeaux, Puyang Xu, Nathan Glenn, Sanjeev Khudanpur, Murat Saraclar, Dan Bikel, Mark Dredze, Chris Callison-Burch, Yuan Cao, Keith Hall, Eva Hasler, Philip Koehn, Adam Lopez, Matt Post and Darcey Riley: "Deriving conversation-based features from unlabeled speech for discriminative language modeling", InterSpeech, 2012
  7. Yang Gao, Philipp Koehn and Alexandra Birch: "Soft Dependency Constraints for Reordering in Hierarchical Phrase-Based Translation", Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2011.
  8. Philipp Koehn: "What is a better translation? Reflections on six years of running evaluation campaigns", Tralogy, 2011.
  9. Philipp Koehn and Jean Senellart: "Fast Approximante String Matching with Suffix Arrays and A* Parsing", Meeting of the Association for Machine Translation of the Americas (AMTA), 2010.
  10. Philipp Koehn: "Enabling Monolingual Translators: Post-Editing vs. Options", NAACL, 2010.
  11. Philipp Koehn, Alexandra Birch and Ralf Steinberger: "462 Machine Translation Systems for Europe", Machine Translation Summit XII, p. 65-72, 2009.
  12. Philipp Koehn and Barry Haddow: "Interactive Assistance to Human Translators using Statistical Machine Translation Methods", Machine Translation Summit XII, p. 73-80, 2009.
  13. Loïc Dugast, Jean Senellart and Philipp Koehn: "Selective addition of corpus-extracted phrasal lexical rules to a rule-based machine translation system", Machine Translation Summit XII, p. 222-229, 2009.
  14. Philipp Koehn: "A Web-Based Interactive Computer Aided Translation Tool", ACL Software demonstration session, 2009.
  15. Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez, and Philipp Koehn: "Monte Carlo inference and maximization for phrase-based translation", Conference on Natural Language Learning, 2009.
  16. Josh Schroeder, Trevor Cohn and Philipp Koehn: "Word Lattices for Multi-Source Translation", Meeting of the European Chapter of the Association for Computational Linguistics (EACL), 2009.
  17. Hieu Hoang and Philipp Koehn: "Improving Mid-Range Re-Ordering using Templates of Factors", Meeting of the European Chapter of the Association for Computational Linguistics (EACL), 2009.
  18. Holger Schwenk and Philipp Koehn: "Large and Diverse Language Models for Statistical Machine Translation", International Joint Conference on Natural Language Processing (IJCNLP), p. 661-666, 2008.
  19. Eleftherios Avramidis and Philipp Koehn: "Enriching Morphologically Poor Languages for Statistical Machine Translation", 46th Annual Meeting of the Association for Computational Linguistics (ACL), p. 763-770, 2008.
  20. Alexandra Birch, Miles Osborne and Philipp Koehn: "Predicting Success in Machine Translation", Conference on Empirical Methods in Natural Language Processing (EMNLP), p. 745-749, 2008.
  21. Abhishek Arun and Philipp Koehn: "Online Learning Methods For Discriminative Training of Phrase Based Statistical Machine Translation", Machine Translation Summit XI, p. 15-20, 2007.
  22. Philipp Koehn and Hieu Hoang: "Factored Translation Models", Conference on Empirical Methods in Natural Language Processing (EMNLP), p. 868-876, 2007.
  23. Chao Wang, Michael Collins and Philipp Koehn: "Chinese Syntactic Reordering for Statistical Machine Translation", Conference on Empirical Methods in Natural Language Processing (EMNLP), p. 737-745, 2007.
  24. Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, Evan Herbst: "Moses: Open Source Toolkit for Statistical Machine Translation", ACL demonstration session, p. 177-180, 2007.
  25. Chris Callison-Burch, Philipp Koehn and Miles Osborne: "Improved Statistical Machine Translation Using Paraphrases", Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL), p. 17-24, 2006.
  26. Chris Callison-Burch, Miles Osborne and Philipp Koehn: "Re-evaluating the Role of Bleu in Machine Translation Research", Meeting of the European Chapter of the Association for Computational Linguistics (EACL), p. 249-256, 2006.
  27. Michael Collins, Philipp Koehn, and Ivona Kucerova: "Clause Restructuring for Statistical Machine Translation", 43rd Annual Meeting of the Association for Computational Linguistics (ACL), p. 531-540, 2005.
  28. Philipp Koehn: "Europarl: A Parallel Corpus for Statistical Machine Translation", Machine Translation Summit X, p. 79-86, 2005.
  29. Philipp Koehn: "Statistical Significance Tests for Machine Translation Evaluation", Conference on Empirical Methods in Natural Language Processing (EMNLP), p. 388-395, 2004.
  30. Philipp Koehn: "Pharaoh: a Beam Search Decoder for Phrase-Based Statistical Machine Translation Models", Meeting of the American Association for Machine Translation (AMTA), p. 115-124, 2004.
  31. Philipp Koehn and Kevin Knight: "Feature-Rich Statistical Translation of Noun Phrases", 41st Annual Meeting of the Association for Computational Linguistics (ACL), p.311-318, 2003.
  32. Philipp Koehn and Kevin Knight: "Empirical Methods for Compound Splitting", 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL), p. 187-193, 2003.
  33. Philipp Koehn, Franz Josef Och, and Daniel Marcu: "Statistical Phrase-Based Translation", Human Language Technology Conference and Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), p. 127-133, 2003.
  34. Douglas W. Oard, David Doermann, Bonnie Dorr, Daqing He, Philip Resnik, Amy Weinberg, William Byrne, Sanjeev Khudanpur, David Yarowsky, Anton Leuski, Philipp Koehn, and Kevin Knight: "Desperately Seeking Cebuano", Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), p. 76-78, 2003.
  35. Philipp Koehn: "Combining Multiclass Maximum Entropy Text Classifiers with Neural Network Voting", Portugal for Natural Language Processing, Third International Conference (PorTAL), p. 125-136, 2002.
  36. Philipp Koehn and Kevin Knight: "Knowledge Sources for Word-Level Translation Models", Conference on Empirical Methods in Natural Language Processing (EMNLP), p. 27-35, 2001.
  37. Philipp Koehn and Kevin Knight: "Estimating Word Translation Probabilities from Unrelated Monolingual Corpora Using the EM Algorithm", Seventeenth National Conference on Artificial Intelligence (AAAI), p. 711-715, 2000.
  38. Yaser Al-Onaizan, Ulrich Germann, Ulf Hermjakob, Kevin Knight, Philipp Koehn, Daniel Marcu, and Kenji Yamada: "Translating with Scarce Resources", Seventeenth National Conference on Artificial Intelligence (AAAI), p. 672-678, 2000.
  39. Philipp Koehn, Steven Abney, Julia Hirschberg, and Michael Collins: "Improving Intonational Phrasing with Syntactic Information", International Conference on Acoustics, Speech, and Signal Processing (ICASSP), p. 1289-1290, 2000.
  40. Philipp Koehn: "Genetic Encoding Strategies for Neural Networks", Sixth International Conference on Information Processing and Management of Uncertainty (IPMU), p. 947-950, 1996.

Journal Articles

  1. Wenduan Xu and Philipp Koehn: "Extending Hiero Decoding in Moses with Cube Growing", The Prague Bulletin of Mathematical Linguistics, Volume 98, pages 133-142, October 2012.
  2. Eva Hasler, Barry Haddow and Philipp Koehn: "Margin Infused Relaxed Algorithm for Moses", The Prague Bulletin of Mathematical Linguistics, Volume 96, pages 69-78, October 2011.
  3. Abhishek Arun, Barry Haddow, Philipp Koehn, Adam Lopez, Chris Dyer and Miles Osborne: "Monte Carlo Rechniques for Phrase-Based Translation" Machine Translation Journal, Volume 24, Number 2, pages 103-121, 2010.
  4. Philipp Koehn: "An Experimental Management System", The Prague Bulletin of Mathematical Linguistics, number 94, pages 87-96, September 2010.
  5. Philipp Koehn: "A Process Study of Computed Aided Translation", Machine Translation, volume 23, number 4, pages 241-263, 2009.
  6. Philipp Koehn: "Review of Cyril Goutte, Nicola Cancedda, Marc Dymetman, and George Foster (eds): Learning machine translation", Machine Translation Journal, volume 23, number 4, pages 269-271, 2009.
  7. Yaser Al-Onaizan, Ulrich Germann, Ulf Hermjakob, Kevin Knight, Philipp Koehn, Daniel Marcu, and Kenji Yamada: "Translation with Scarce Bilingual Resources", Machine Translation 17: 1-17, 2002.

Book Chapters

  1. Philipp Koehn and Hieu Hoang: "Factored Translation Models", Chapter in Handbook of Natural Language Processing and Machine Translation, editors Olive, Christianson, and McCary, Springer, 2011.
  2. Philipp Koehn: "Machine Translation", Chapter in Multilingual Natural Language Processing Applications: From Theory to Practice, editors Imed Zitouni and Daniel M. Bikel, IBM Press, 2012.

Refereed Workshop Papers

  1. Philipp Koehn: "Simulating Human Judgment in Machine Translation Evaluation Campaigns", International Workshop on Spoken Language Translation (IWSLT), 2012.
  2. Eva Hasler, Barry Haddow and Philipp Koehn: "Sparse Lexicalised Features and Topic Adaptation for SMT", International Workshop on Spoken Language Translation (IWSLT), 2012.
  3. Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals and Pawel Swietojanski: "The UEDIN Systems for the IWSLT 2012 Evaluation", International Workshop on Spoken Language Translation (IWSLT), 2012.
  4. Philipp Koehn and Barry Haddow: "Towards Effective Use of Training Data in Statistical Machine Translation", Proceedings of the Seventh Workshop on Statistical Machine Translation (WMT), 2012.
  5. Philip Williams and Philipp Koehn: "GHKM Rule Extraction and Scope-3 Parsing in Moses", Proceedings of the Seventh Workshop on Statistical Machine Translation (WMT), 2012.
  6. Barry Haddow and Philipp Koehn: "Analysing the Effect of Out-of-Domain Data on SMT Systems", Proceedings of the Seventh Workshop on Statistical Machine Translation (WMT), 2012.
  7. Kenneth Heafield, Hieu Hoang, Philipp Koehn, Tetsuo Kiso and Marcello Federico: "Left Language Model State for Syntactic Machine Translation", International Workshop on Spoken Language Translation (IWSLT), 2011.
  8. Philip Williams and Philipp Koehn: "Agreement Constraints for Statistical Machine Translation into German", Proceedings of the Sixth Workshop on Statistical Machine Translation (WMT), 2011.
  9. Barry Haddow, Abhishek Arun and Philipp Koehn: "SampleRank Training for Phrase-Based Machine Translation", Proceedings of the Sixth Workshop on Statistical Machine Translation (WMT), 2011.
  10. Philipp Koehn and Jean Senellart: "Convergence of Translation Memory and Statistical Machine Translation", AMTA Workshop on MT Research and the Translation Industry, 2010.
  11. Philipp Koehn, Barry Haddow, Philip Williams and Hieu Hoang: "More Linguistic Annotation for Statistical Machine Translation", Fifth ACL Workshop on Statistical Machine Translation and MetricsMATR, 2010.
  12. Ronan Le Nagard and Philipp Koehn: "Aiding Pronoun Translation with Co-Reference Resolution", Fifth ACL Workshop on Statistical Machine Translation and MetricsMATR, 2010.
  13. Abhishek Arun, Barry Haddow and Philipp Koehn: "A Unified Approach to Minimum Risk Training and Decoding", Fifth ACL Workshop on Statistical Machine Translation and MetricsMATR, 2010.
  14. Hieu Hoang and Philipp Koehn: "Improved Translation with Source Syntax Labels", Fifth ACL Workshop on Statistical Machine Translation and MetricsMATR, 2010.
  15. Hieu Hoang, Philipp Koehn and Adam Lopez "A uniform framework for phrase-based, hierarchical and syntax-based machine translation", International Workshop on Machine Translation (IWSLT), 2009.
  16. Loïc Dugast, Jean Senellart and Philipp Koehn: "Statistical Post Editing and Dictionary Extraction: Systran/Edinburgh submissions for ACL-WMT2009", EACL Workshop on Statistical Machine Translation, 2009.
  17. Philipp Koehn and Barry Haddow: "Edinburgh’s Submission to all Tracks of the WMT2009 Shared Task with Reordering and Speed Improvements to Moses", EACL Workshop on Statistical Machine Translation, 2009.
  18. Michael Auli, Adam Lopez, Hieu Hoang and Philipp Koehn: "A Systematic Analysis of Translation Model Search Spaces", EACL Workshop on Statistical Machine Translation, 2009.
  19. Hieu Hoang and Philipp Koehn: "Design of the Moses Decoder for Statistical Machine Translation", ACL Workshop on Software engineering, testing, and quality assurance for NLP, 2008.
  20. Loic Dugast, Jean Senellart and Philipp Koehn: "Can we Relearn an RBMT System?", ACL Workshop on Statistical Machine Translation, 2008.
  21. Philipp Koehn, Abhishek Arun and Hieu Hoang: "Towards better Machine Translation Quality for the German-English Language Pairs", ACL Workshop on Statistical Machine Translation, 2008.
  22. Philipp Koehn and Josh Schroeder: "Experiments in Domain Adaptation for Statistical Machine Translation", ACL Workshop on Statistical Machine Translation, 2007.
  23. Alexandra Birch, Miles Osborne and Philipp Koehn: "CCG Supertags in Factored Statistical Machine Translation", ACL Workshop on Statistical Machine Translation, 2007.
  24. Loic Dugast, Jean Senellart, Michel Simard and Philipp Koehn: "Statistical Post-Edition on SYSTRAN Rule-Based Translation System", ACL Workshop on Statistical Machine Translation, 2007.
  25. Josh Schroeder and Philipp Koehn: "The University of Edinburgh System Description for IWSLT 2007", International Workshop on Spoken Language Translation (IWSLT), 2007.
  26. Alexandra Birch, Chris Callison-Burch, Miles Osborne and Philipp Koehn: "Constraining the Phrase-Based, Joint Probability Statistical Translation Model", NAACL Workshop on Statistical Machine Translation, 2006.
  27. Abhishek Arun, Amittai Axelrod, and Philipp Koehn: "Edinburgh System Description for the 2006 TC-STAR Spoken Language Translation Evaluation", TC-Star Evaluation Workshop, 2006.
  28. Philipp Koehn, Amittai Axelrod, Alexandra Birch Mayne, Chris Callison-Burch, Miles Osborne and David Talbot: "Edinburgh System Description for the 2005 IWSLT Speech Translation Evaluation", International Workshop on Spoken Language Translation (IWSLT), pages 78-85, 2005.
  29. Philipp Koehn and Kevin Knight: "Learning a Translation Lexicon from Monolingual Corpora", 40th Annual Meeting of the Association for Computational Linguistics (ACL), Workshop on Unsupervised Lexical Acquisition, 2002.

Other Workshop Papers

  1. Chris Callison-Burch, Philipp Koehn, Christof Monz, Matt Post, Radu Soricut and Lucia Specia: "Findings of the 2012 Workshop on Statistical Machine Translation", Proceedings of the Seventh Workshop on Statistical Machine Translation (WMT), 2012.
  2. Chris Callison-Burch, Philipp Koehn, Christof Monz and Omar Zaidan: "Findings of the 2011 Workshop on Statistical Machine Translation", Proceedings of the Sixth Workshop on Statistical Machine Translation (WMT), 2011.
  3. Chris Callison-Burch, Philipp Koehn, Christof Monz, Kay Peterson, Mark Przybocki and Omar Zaidan: "Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation", Fifth ACL Workshop on Statistical Machine Translation and MetricsMATR, 2010.
  4. Chris Callison-Burch, Philipp Koehn, Christof Monz and Josh Schroeder: "Findings of the 2009 Workshop on Statistical Machine Translation", EACL Workshop on Statistical Machine Translation, 2009.
  5. Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz and Josh Schroeder: "Further Meta-Evaluation of Machine Translation", ACL Workshop on Statistical Machine Translation 2008.
  6. Philipp Koehn, Josh Schroeder and Miles Osborne: "Edinburgh University System Description for the 2008 NIST Machine Translation Evaluation", NIST MT Evaluation Meeting, 2008.
  7. Chris Callison-Burch, Cameron Fordyce, Philipp Koehn, Christof Monz and Josh Schroeder: "(Meta-) Evaluation of Machine Translation", ACL Workshop on Statistical Machine Translation 2007.
  8. Philipp Koehn and Christof Monz: "Philipp Koehn and Christof Monz", NAACL Workshop on Statistical Machine Translation, 2006.
  9. Philipp Koehn and Christof Monz: "Shared Task: Statistical Machine Translation between European Languages", ACL Workshop on Parallel Text, 2005.
  10. Philipp Koehn, Amittai Axelrod, Alexandra Birch Mayne, Chris Callison-Burch, Miles Osborne, David Talbot, and Michael White: "Edinburgh System Description for the 2005 NIST MT Evaluation", DARPA/NIST Machine Translation Evaluation Workshop 2005.

Patents

  1. Dragos Munteanu, Daniel Marcu and Philipp Koehn: "Building a translation lexicon from comparable, non-parallel corpora", United States Patent 8234106, awarded 2012
  2. Philipp Koehn "Empirical Methods for Splitting Compound Words with Application to Machine Translation", United States Patent 7711545, awarded 2010.
  3. Philipp Koehn and Kevin Knight "Statistical Machine Translation", United States Patent 7624005, awarded 2009.
  4. Daniel Marcu, Kevin Knight, Dragos Munteanu, Philipp Koehn: "Constructing a translation lexicon from comparable, non-parallel corpora", United States Patent 7620538, awarded 2009.
  5. Daniel Marcu, Kevin Knight, William Wong and Philipp Koehn: "Phrase-Based Joint Probability Model for Statistical Machine Translation", United States Patent 7454326, awarded 2008; European Patent EP1488338B1, awarded 2010.

Tutorials

  1. Philipp Koehn and Hieu Hoang "Open Source Statistical Machine Translation", Conference of the Association for Machine Translation in the Americas (AMTA), 2012.
  2. Philipp Koehn and Hieu Hoang "Statistical Machine Translation with Open Source Software", Conference of the Association for Machine Translation in the Americas (AMTA), 2010.
  3. Kevin Knight and Philipp Koehn "Topics in Statistical Machine Translation", Meeting of the Association for Computational Linguistics (ACL), 2009.
  4. Philipp Koehn et al. "Introduction to Statistical Machine Translation", MT Marathon, EuroMatrix project, 2007, 2008, 2009.
  5. Philipp Koehn and Dennis Mehay, "Introduction to Statistical Machine Translation", Meeting of the American Association for Machine Translation (AMTA), 2008.
  6. Chris Callison-Burch and Philipp Koehn, "Statistical Machine Translation", European Summer School for Logic, Language and Information (ESSLLI), 2008.
  7. Philipp Koehn "AERFAI Summer School Phrase-Based and Factored Statistical Machine Translation", AERFAI Summer School, Basque Country, Spain, 2008.
  8. Philipp Koehn and Ashish Venugopal "Statistical Machine Translation", Nordic Doctoral School, Tartu, Estonia, 2007.
  9. Kevin Knight and Philipp Koehn "Statistical Machine Translation", Machine Translation Summit, 2007.
  10. Philipp Koehn, Kevin Knight, Philip Resnik, and Lauri Gerber "Statistical Machine Translation", Stanford Linguistic Summer School, 2007.
  11. Philipp Koehn: "Statistical Machine Translation: The Basic, the Novel, and the Speculative", Meeting of the European Chapter of the Association for Computational Linguistics (EACL), 2006
  12. Chris Callison-Burch and Philipp Koehn: "Introduction to Statistical Machine Translation", European Summer School for Logic, Language and Information (ESSLLI), 2005
  13. Kevin Knight and Philipp Koehn: "What's New in Statistical Machine Translation?", Human Language Technology Conference and Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2003
  14. Kevin Knight and Philipp Koehn: "Introduction to Statistical Machine Translation", Machine Translation Summit IX, 2003; Human Language Technology Conference and Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2004; Meeting of the American Association for Machine Translation (AMTA), 2004

Invited Talks

  1. "Open Problems in Statistical Machine Translation", University of California at San Diego, 2012
  2. "Enabling Monolingual Translation", AMTA Workshop on Monolingual Machine Translation (MONOMT), 2012
  3. "Hybrid Machine Translation? It's complicated...", EACL ESIRMT-HyTra Workshop, 2012
  4. "Statistical Machine Translation and Language Barriers", WIPO Symposium of IP Authorities, Geneva, Switzerland, 2011
  5. "What is a better translation? Reflections on six years of running evaluation campaigns", Tralogy, Paris, France, 2011
  6. "Factored Models for Morphology", Workshop on Machine Translation and Morphologically-rich Languages, Haifa, Israel, 2011
  7. "New Frontiers in Statistical Machine Translation", IEEE Workshop on Spoken Language Technology, Berkeley, USA, 2010
  8. "EuroMatrixPlus - Bringing Machine Translation to the User", Conference of the Association for Machine Translation in the Americas, Denver, USA, 2010
  9. "Computer Aided Translation", NSF Workshop on Crowdsourcing and Translation, College Park, Maryland, USA, 2010
  10. "Human Translation and Machine Translation", Translation Forum: "Translatio ex machina", Luxembourg, 2010
  11. "EuroMatrixPlus: Bringing Machine Translation to the User", Language Technology Days, Luxembourg, 2010
  12. "Human Translation and Machine Translation", International Workshop on Spoken Language Translation, Tokyo, Japan, 2009
  13. "Learning from the Web: The Wealth of Data and its Consequences", JOIN Meeting, University of Minho, Braga, Portugal, 2009
  14. "Human Translation and Machine Translation", University of California at Berkeley, 2009
  15. "Linguistic Problems for Statistical Machine Translation", Stanford University, 2009
  16. "20 Years of Statistical Machine Translation", Translingual Europe, Prague, 2009
  17. "State of the Art in SMT and SMT Training", TAUS Executive Forum, Edinburgh, 2009
  18. "State of the Art in Statistical Machine Translation", workshop with EU translation services, Luxembourg, 2009
  19. "Statistical Machine Translation at the University of Edinburgh", Microsoft Research Asia, Beijing, 2008
  20. "Introduction to Statistical Machine Translation", Chinese Workshop for Machine Translation, 2008
  21. "The German Challenge to Statistical Machine Translation", Polytechnic University of Catalonia, Barcelona, Dublin City University, 2008
  22. "The Emerging Role of Grammars in Statistical Machine Translation", Workshop on Formal Grammars, Hamburg, 2008
  23. "State of the Art in Statistical Machine Translation", Systran, 2008
  24. "Statistical Machine Translation - Where are we now?", SMART project meeting, Bristol, United Kingdom, 2008
  25. "Results of the EuroMatrix Shared Task in Machine Translation", Translingual Europe, Berlin, 2008
  26. "State of the Art in Statistical Machine Translation", Translingual Europe, Berlin, 2008
  27. "Open Source Tools for Statistical Machine Translation", LinguaTech, Rome, Italy, 2008
  28. "Moving towards Linguistically Grounded Statistical Machine Translation with an Open Source Research Environment", MT Symposium, Tokyo, Japan, 2008
  29. "Moses: Moving Open Source MT towards Linguistically Richer Models", Workshop on Mixing Approaches to Machine Translation (MAMT), 2008
  30. "Statistical Machine Translation with an Eye on Linguistics", NLP Winter School 2008, Hyderabad, India, 2008
  31. "Linguistic Problems for Statistical Machine Translation", University of Amsterdam, 2007
  32. "Statistical Machine Translation - Where are we now?", Research Triangle Institute, North Carolina, USA, 2007
  33. "Machine Translation at Edinburgh - Factored Translation Models and Discriminative Training", SRI, San Jose, United States, 2007
  34. "EuroMatrix - Machine Translation for all European Languages", Machine Translation Summit, 2007
  35. "Evaluating Evaluation - Lessons from the WMT 2007 Shared Task", Machine Translation Summit Workshop on Machine Translation Evaluation, 2007
  36. "Statistical Machine Translation: What is Possible Now?", Kasetsart University, Bangkok, Thailand, 2006
  37. "EuroMatrix - Machine Translation for all EU Languages", IST, Helsinki, Finland, 2006
  38. "Statistical Machine Translation", CLSP Summer School, Johns Hopkins University, Baltimore, 2006
  39. "The Wonders of Statistical Machine Translation", University of Sheffield; University of Oxford; Systran, 2006
  40. "Challenges in Statistical Machine Translation", Google; University of Edinburgh, 2004
  41. "Advances in Statistical Machine Translation: Phrases, Noun Phrases and Beyond", IBM; University of Maryland; Massachusetts Institute of Technology, 2003
  42. "Estimating Word Translation Probabilities from Unrelated Monolingual Corpora Using the EM Algorithm", RWTH Aachen, 2000

Panel

\item AMTA Workshop on MT Research and the Translation Industry, 2010
"New Developments in Localization R&D, Localization and Translation Thailand, Bangkok, Thailand, 2009
Workshop on Patent Translation, Machine Translation Summit, 2009
Translingual Europe on "Will there Be a Winner? Competition of the MT Paradigms", 2009
EACL Student Research Workshop, 2009
Translingual Europe, 2008
International Workshop on Spoken Language Translation (IWSLT), chair, 2007
MT Summit Panel on Future trends in MT, 2007
IST Panel on "Multilinguism and Language Technology: a Challenge for Europe", 2006
AMTA Panel on Hybrid MT, 2006
ACL Student Research Workshop, 2005
AMTA Panel on MT Evaluation, 2004

Reviewing

Computational Linguistics Journal, 2006, 2007, 2009, 2010, 2011, 2012
ACM Transactions on Speech and Language Processing, 2012
Communications of the ACM, 2012
Journal of Machine Learning Research (JMLR), 2011
Journal of Intelligent Information Systems (JIIS), 2010
Journal of Computer Science and Technology, China, 2009, 2010
Computer Speech and Language Journal, Elsevier, 2006, 2008, 2010, 2011, 2012
Machine Translation Journal, 2007, 2008, 2009, 2010, 2011, 2012
Pattern Recognition Journal, 2011
Machine Learning Journal, 2011
IEEE Transactions on Audio, Speech and Language Processing, 2004, 2007, 2009, 2012
Transactions on Asian Language Information Processing, 2010
Journal on Natural Language Engineering, 2007, 2008, 2011

Annual Conference of the Association for Computation Linguistics (ACL), 2005, 2006, 2007, 2008, 2010, 2011, 2012
Conference on Empirical Natural Language Processing (EMNLP), 2004, 2005, 2007, 2010, 2011, 2012
Conference of the American Association for Artificial Intelligence (AAAI), 2008
Computational Linguistics Conference (Coling), 2004, 2006, 2008, 2010
International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), 2012
International Conference on Natural Language Processing (ICON), 2009
International Joint Conference on Artificial Intelligence (IJCAI), 2006
International Joint Conference on Natural Language Processing (IJCNLP), 2003, 2011
International Conference on Asian Language Processing (IALP), 2011
Conference of the Spanish Association for Artificial Intelligence (CAEPIA), 2011
Artificial Intelligence and Simulation of Behaviour Conference (AISB), 2011
Conference on Human Language Technology and North American Chapter of the Association for Computation Linguistics Meeting (HLT/NAACL), 2006, 2007, 2012
Conference on Natural Language Learning (CoNLL), 2008, 2010, 2011
Conference of the European Chapter of the Association for Computation Linguistics (EACL), 2006, 2009, 2012
Machine Translation Summit, 2007, 2009, 2011
Conference of the Association for Machine Translation in the Americas (AMTA), 2006, 2008, 2010, 2012
Conference of the European Association for Machine Translation (EAMT), 2009, 2010, 2011
Iberian Conference on Pattern Recognition and Image Analysis, 2009
Information Retrieval Facility Conference (IRFC), 2010
Theoretical and Methodological Issues in Machine Translation (TMI), 2007
Language Resources and Evaluation Conference (LREC), 2008, 2010, 2012
International Conference on Machine Learning (ICML), 2008
Conference of the European Conference on Artificial Intelligence (ECAI), 2008, 2010

International Workshop on Spoken Language Translation (IWSLT), 2005, 2006, 2007, 2009, 2010, 2011
Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST), 2011, 2012
Workshop on Monolingual Machine Translation (MONOMT), 2012
Workshop on Reordering for Statistical Machine Translation, 2012
Workshop on Creating Cross-language Resources for Disconnected Languages and Styles (CREDISLAS), 2012
International Workshop on Free/Open-source Rule-based Machine Translation (FreeRBMT), 2012
Symposium on Languages, Applications and Technologies (SLATE), 2012
Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, 2011
International Workshop on Natural Language Processing and Cognitive Science (NLPCS), 2011
Workshop on Using Linguistic Information for Hybrid Machine Translation (LIHMT), 2011
Workshop on Domain Adaptation for Natural Language Processing (DANLP), 2010
TC-STAR Workshop on Speech-to-Speech Translation, 2006
European Summer School for Logic, Language and Information (ESSLLI), 2006, 2007
ACL Student Research Workshop, 2005, 2006, 2007
HLT/NAACL Workshop on Joint Inference for Natural Language Processing, 2006
HLT/NAACL Student Research Workshop, 2009
ACL Workshop on Evaluation Measures for MT and Summarization, 2005
Workshop on Mixing Approaches to Machine Translation (MAMT), 2008

Chairing

ACL Special Interest Group on Machine Translation (SIGMT), president, since 2009
ACL Special Interest Group on Linguistic Data and Corpus-Based Approaches to NLP (SIGDAT), vice president elect, 2012
Conference on Empirical Natural Language Processing (EMNLP), general chair, 2009
International Joint Conference on Artificial Intelligence (IJCAI), area co-chair natural language processing, 2013
Annual Meeting of the Association for Computational Linguistics (ACL-IJCNLP), area co-chair machine translation, 2009
Conference on Human Language Technology and North American Chapter of the Association for Computation Linguistics Meeting (HLT/NAACL), area chair machine translation, 2007, 2009, 2010
ACL Workshop on Statistical Machine Translation, co-chair, 2006, 2007, 2008, 2009, 2010, 2011, 2012
AMTA Workshop on Bringing MT to the User, co-chair, 2010
Machine Translation Marathon, chair, 2007, 2008, 2009, 2010, 2011, 2012
Joint Euromatrixplus/CNGL Workshop (JEC), co-chair/senior program committee, 2009-2011
International Joint Conference on Natural Language Processing (IJNLP), area chair machine translation, 2008
Computational Linguistics Conference (Coling), area chair machine translation, 2012
Conference on Natural Language Processing (KONVENS), area chair machine translation, 2012
Computational Linguistics Conference (Coling), tutorial chair, 2008
Conference on Empirical Natural Language Processing (EMNLP), area chair machine translation, 2006
ACL Workshop on Parallel Text, co-chair, 2005
Association for Computational Linguistics (ACL), sponsorship co-chair, 2009
Annual Meeting of the Association for Computational Linguistics (ACL-IJCNLP), exhibition co-chair, 2009
Annual Meeting of the Association for Computational Linguistics (ACL), publication chair, 2005, 2010

Editorial Board

Computation Linguistics, 2010-2012
Machine Translation Journal, since 2007
ACM Transactions on Asian Language Information Processing, 2010-2013
Translation: Computation, Corpora, Cognition, since 2011
Transactions of the Association of Computational Linguistics, action editor, since 2012
The Prague Bulletin of Mathematical Linguistics, since 2010
ACM Transactions on Asian Language Information Processing, guest editor, 2008
The Prague Bulletin of Mathematical Linguistics, guest editor, 2009

PhD Students

Hieu Hoang: "Factored Translation Models", graduated 2010
Abhishek Arun: "Machine Learning Methods for Statistical Machine Translation", graduated 2010

Loic Dugast: "Hybrid Machine Translation", since 2006, expected graduation 2012
Michael Auli, since 2008, expected graduation 2011
Philip Williams, since 2009, expected graduation 2012
Eva Hasler, since 2010, expected graduation 2013
Christian Buck, since 2011, expected graduation 2014
Maria Nadejde,, since 2012, expected graduation 2015

External Thesis Committee

Arianna Bisazza:, University of Trento, Italy, 2013
Simon Carter: "Exploration and Exploitation of Multilingual Data for Statistical Machine Translation", University of Amsterdam, The Netherlands, 2012
Gennadi Lembersky: "The Effect of Translationese on Statistical Machine Translation", University of Haifa, Israel, 2012
Tsuyoshi Okita: "Word Alignment and Smoothing Method in Statistical Machine Translation: Noise, Prior Knowledge, and Overfitting", Dublin City University, Ireland, 2011
Daniel Ortiz Martinez: "Some Contributions to Fully Automatic and Interactive Phrase-Based Statistical Machine Translation", Politechnic University of Valencia, Spain, 2011
Yvette Graham: "Deep Syntax in Statistical Machine Translaton", Dublin City University, Ireland, 2010
James Brunning: "Statistical Machine Translation", Cambridge University, United Kingdom, 2010
Jesus Andres: "Statistical Approaches for Natural Language Modelling and Monotone Machine Translation", Politechnic University of Valencia, Spain, 2010
Jorge Civera Saiz: "Novel statistical approaches to text classification, machine translation and computer-assisted translation", Politechnic University of Valencia, Spain, 2008
Jesus Gimenez: "Exploiting current natural language processing technology for empirical machine translation and its evaluation", Politechnic University of Catalonia, Barcelona, Spain, 2008
Marta Ruiz: "New reordering and modelling approaches for statistical machine translation", Politechnic University of Catalonia, Barcelona, Spain, 2008
Jakob Elming: "Syntactic recording in statistical machine translation", Copenhagen Business School, Denmark, 2008
Daniel Déchelotte: "Automatic speech translation by statistical methods", University of Paris-Sud 11, France, 2007

Funding

"CASMACAT - Cognitive Analysis and Statistical Methods for Advanced Computer Aided Translation", co-ordinator, €2.5 million, PI for University Edinburgh's share (€760,000), FP7 STREP, European Commission, 2011-2014
"MateCat - Machine Translation Enhanced Computer Aided Translation", PI for University Edinburgh's share (€450,000), FP7 STREP, European Commission, 2011-2014
"ACCEPT - Automated Community Content Editing Portal", PI for University Edinburgh's share (€500,000), FP7 STREP, European Commission, 2012-2015
"MosesCore - Moses Open Source Evaluation and Support Co-ordination for Outreach and Exploitation", co-ordinator, €1.2 million, PI for University Edinburgh's share (€520,000), FP7 CSA, European Commission, 2012-2015
"EU Bridge - Bridges across the Language Divide", PI for University Edinburgh's share (€1 million), FP7 IP, European Commission, 2012-2015
"BOLT", subcontract, PI for University Edinburgh, US$200,000 per year, DARPA, United States, 2011-
"TRAD", subcontract, PI for University Edinburgh, €50,000, French Defense Department, 2011-
"LetsMT! - Platform for Online Sharing of Training Data and Building User Tailored MT", €1.67 million, PI for University Edinburgh's share (€173,200), ICT PSP, European Commission, 2010-2013
"Machine Learning Techniques for Modeling of Language Varieties", Vienna Science Research and Technology Fund (WWTF), €75,000, 2010-2012
"EuroMatrixPlus - Bringing Machine Translation to the User", scientific co-ordinator, €3.5 million, PI for University Edinburgh's share (€700,000), FP7 STREP, European Commission, 2009-2012
"EuroMatrix - Machine Translation for all European Languages", scientific co-ordinator, €2.5 million, PI for University Edinburgh's share (€700,000), FP6 STREP, European Commission, 2006-2009
"Autonomous Global Integrated Language Exploitation (AGILE)", subcontract, PI for University Edinburgh, US$200,000 per year, DARPA, United States, 2005-2011