Mirella Lapata
Home
Research
Papers
Projects
Resources
Students
CNN highlights dataset
used in Woodsend and Lapata (2010, ACL); contains alignments of CNN highlights with document sentences.
phrase similarity dataset used in
Mitchell and Lapata (2008, ACL)
and
Mitchell and Lapata (2010, Cognitive Science)
.
BBC news data set
for image annotation experiments used in Feng and Lapata (2008, ACL) and Feng and Lapata (2010, NAACL).
Compression corpora
used in Clarke and Lapata (2008, JAIR) ad Clarke and Lapata (2010, CL).
Paraphrase corpus
used in Cohn et al. (2008, CL).
Bilingual corpus
with semantic role annotations used in Padó and Lapata (2009, JAIR).
T3
The Tree-Transducer Toolkit.
DependencyVectors
software package to produce vector space models from dependency-parsed corpora.
Brown
coherence toolkit
.