The query set used in this paper is here.

 

The annotation data (relevant questions to a query in the query set) used in our papers is here.

The format of annotation data file is as follows:

__________________________________

ID Query Question

Relevant Historical Question 1

Relevant Historical Question 2

... ...

__________________________________

 

If you need the question repository used in this paper, please send us an email and we will find a way to send you the whole repository extracted from Yahoo! Answers.

 

If you use our annotation data, please cite our papers:

Xin Cao, Gao Cong, B. Cui, Christian S. Jensen and Ce Zhang: "The Use of Categorization Information in Retrieving Questions ". The 18th ACM Conference on Information and Knowledge Management (CIKM), pp.265-274, 2009

Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen: "A Generalized Framework of Exploring Category Information for Question Retrieval in Community Question Answer Archives" . The 19th International World Wide Web Conference (WWW), pp 201-210, 2010