next up previous
Next: Introduction Up: Download Previous: Download

THE THISL BROADCAST NEWS RETRIEVAL SYSTEM


Date:


(1) University of Sheffield, Department of Computer Science, UK
(2) BBC, Research and Development, UK
(3) SoftSound, UK

Abstract:

This paper described the THISL spoken document retrieval system for British and North American Broadcast News. The system is based on the ABBOT large vocabulary speech recognizer, using a recurrent network acoustic model, and a probabilistic text retrieval system. We discuss the development of a realtime British English Broadcast News system, and its integration into a spoken document retrieval system. Detailed evaluation is performed using a similar North American Broadcast News system, to take advantage of the TREC SDR evaluation methodology. We report results on this evaluation, with particular reference to the effect of query expansion and of automatic segmentation algorithms.



 

Steve Renals
1999-04-30