Claire Grover

Senior Research Fellow in the Language Technology Group at the Human Communication Research Centre, which is part of the School of Informatics

 
  

Email: C.Grover@ed.ac.uk
Tel: +44 131 650 4441
Fax: +44 131 650 4587

 

Publications


Current and Recent Projects

Trading Consequences. Text mining of thousands of pages of historical documents related to commodity trading in the British Empire.

DEEP (The Digitisation of English Placenames). This project aims to digitise the 86 volumes of the Survey of English Place-Names, a county by county survey started in 1922 by the specialists of the English Place-Name Society (EPNS). Our role is to convert the semi-structured text into a full structured resource from which a historical gazetteer is derived for use with the Edinburgh Geoparser as embedded in Unlock Text.A browsable view of the gazetteer can be found at placenames.org.uk

SYNC3 (Synergetic Content Creation and Communication). The goal of SYNC3 is to create a framework for structuring, rendering more accessible and enabling collaborative creation of the extensive user-provided content that is located in personal blogs and refers to running news issues. Funded by the European Union's 7th Framework Programme: Information and Communication Technologies (ICT).

GeoDigRef. GeoDigRef is a short project investigating the advantages of metadata enrichment across three diverse resource collections funded under the JISC Digitisation programme.

TXM. Text Mining for Biomedical Content Curation

EASIE (Edinburgh And Stanford Information Extraction). Combining Shallow Semantics and Domain Knowledge: the project builds on existing techniques for information extraction (IE) in order to develop and implement improved methods for extracting semantic content from text.

SQUAD. Smart Qualitative Data: Methods and Community Tools for Data Mark-Up.

SEER (Stanford Edinburgh Entity Recognition) - Named Entity Recognition with emphasis on bootstrapping, machine learning and porting to new domains (Biomedicine, Astronomy, Archaeology)

SUM - Summarisation of Legal Texts

CROSSMARC (CROSS-lingual Multi-Agent retail Comparison) - Information Extraction from web pages

DISP (Data Intensive Semantics and Pragmatics - Corpus-based lexical semantics in the biomedical domain