Silvia Pareti

I am a PhD student at the Institute for Language, Cognition and Computation (ILCC) of the School of Informatics, University of Edinburgh, under the supervision of Prof. Bonnie Webber and Dr. Kees van Deemter. I am conducting studies in the field of Attribution, funded by the Scottish Informatics and Computer Science Alliance (SICSA).

PARC - Penn Attribution Relations Corpus

The Penn Attribution Relations Corpus (PARC) is a project at the ILCC Institute for Language, Cognition and Computation, University of Edinburgh. The goal of the project is the development of a large scale corpus of Attribution Relations (AR) built on top of the Penn Discourse Treebank (PDTB). ARs are annotated at both the inter- and intra-sentential levels and when nested or embedded into another AR. ARs in the PARC are taken to be lexically anchored by means of an attributional cue, usually a verb, providing the link between the source and the attributed content.

The PARC is aimed to support the training and evaluation of models for:

  • the extraction of quotations and other types of attribution contents (e.g. beliefs, orders, knowledge)
  • the attribution of quotations to their source

PARC 2.0 The full description of the corpus, annotation schema and inter-annotator agreement study are reported in (Pareti, 2012). Although not fully annotated, the corpus has been already used in a number of studies and is available for research purposes. If interested in this version of the corpus, please contact me.

Docs Tokens ARs
2,280 1,139k 9,882

PARC 3.0 (Ongoing) Completing and extending the annotation of the corpus in order to produce and release a complete resource. A team of annotators with a linguistics background are currently working on:

  • the completion of the annotation of first-level ARs
    Completed 02/2014
  • the annotation of nested ARs
Additionally, we also plan:
  • the revision of the ARs in PARC 2.0
  • the annotation of the attribution type (Quotation/Other)

Studies Using PARC

  • Almeida, M., Almeida, M. B.,Martins, A. F. T. (2014). A Joint Model for Quotation Attribution and Coreference Resolution. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2014, Gothenburg, Sweden.
  • Pareti, Silvia, O'Keefe, Tim, Konstas,Ioannis, Curran, James R. and Koprinska, Irena, Automatically Detecting and Attributing Indirect Quotations, In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Seattle, U.S., 2013.
  • O'Keefe, Tim, Pareti, Silvia, Curran, James R., Koprinska, Irena and Honnibal, Matthew, A Sequence Labelling Approach to Quote Attribution, In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Jeju, Korea, 2012.