James Clarke and Mirella Lapata. 2006. Models for Sentence Compression: A Comparison across Domains, Training Requirements and Evaluation Measures. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, 377-384. Sydney, Australia.

Sentence compression is the task of producing a summary at the sentence level. This paper focuses on three aspects of this task which have not received detailed treatment in the literature: training requirements, scalability, and automatic evaluation.We provide a novel comparison between a supervised constituentbased and an weakly supervised wordbased compression algorithm and examine how these models port to different domains (written vs. spoken text). To achieve this, a human-authored compression corpus has been created and our study highlights potential problems with the automatically gathered compression corpora currently used. Finally, we assess whether automatic evaluation measures can be used to determine compression quality.


@InProceedings{Clarke:Lapata:06a,
  author =       {James Clarke and Mirella Lapata},
  title =        {Models for Sentence Compression: A Comparison across
                  Domains, Training Requirements and Evaluation Measures},  
  crossref =     {ACL:COLING:06},
  pages =        {377--384}
}

@Proceedings{ACL:COLING:06,
  title =        {Proceedings of the 21st International Conference on
                  Computational Linguistics and 44th Annual Meeting of
                  the Association for Computational Linguistics},  
  booktitle =    {Proceedings of the 21st International Conference on
                  Computational Linguistics and 44th Annual Meeting of
                  the Association for Computational Linguistics},  
  address =      {Sydney, Australia},
  year =         2006
}