Trevor Cohn and Mirella Lapata (2008) Sentence Compression Beyond Word Deletion. In Proceedings of the 22nd International Conference on Computational Linguistics, 137--144. Manchester, UK.

In this paper we generalise the sentence compression task. Rather than simply shorten a sentence by deleting words or constituents, as in previous work, we rewrite it using additional operations such as substitution, reordering, and insertion. We present a new corpus that is suited to our task and a discriminative tree-to-tree transduction model that can naturally account for structural and lexical mismatches. The model incorporates a novel grammar extraction method, uses a language model for coherent output, and can be easily tuned to a wide range of compression specific loss functions.


@InProceedings{cohn-lapata:2008:PAPERS,
  author    = {Cohn, Trevor  and  Lapata, Mirella},
  title     = {Sentence Compression Beyond Word Deletion},
  booktitle = {Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)},
  year      = {2008},
  address   = {Manchester, UK},
  pages     = {137--144}
}