Mirella Lapata's resources

Mirella Lapata

Home Research Papers Projects Resources Students

CNN highlights dataset used in Woodsend and Lapata (2010, ACL); contains alignments of CNN highlights with document sentences.
phrase similarity dataset used in Mitchell and Lapata (2008, ACL) and Mitchell and Lapata (2010, Cognitive Science).
BBC news data set for image annotation experiments used in Feng and Lapata (2008, ACL) and Feng and Lapata (2010, NAACL).
Compression corpora used in Clarke and Lapata (2008, JAIR) ad Clarke and Lapata (2010, CL).
Paraphrase corpus used in Cohn et al. (2008, CL).
Bilingual corpus with semantic role annotations used in Padó and Lapata (2009, JAIR).
T3The Tree-Transducer Toolkit.
DependencyVectors software package to produce vector space models from dependency-parsed corpora.
Brown coherence toolkit.