Next: Introduction
Up: Download
Previous: Download
THE THISL BROADCAST NEWS RETRIEVAL SYSTEM
Date:
(1) University of Sheffield, Department of Computer Science, UK
(2) BBC, Research and Development, UK
(3) SoftSound, UK
Abstract:
This paper described the THISL spoken document retrieval system for
British and North American Broadcast News. The system is based on
the ABBOT large vocabulary speech recognizer, using a
recurrent network acoustic model, and a probabilistic text retrieval
system. We discuss the development of a realtime British English Broadcast
News system, and its integration into a spoken document retrieval
system. Detailed evaluation is performed using a similar North
American Broadcast News system, to take advantage of the TREC SDR
evaluation methodology. We report results on this evaluation, with
particular reference to the effect of query expansion and of
automatic segmentation algorithms.
Steve Renals
1999-04-30