The query set used in this paper is here.
The annotation data (relevant questions to a query in the query set) used in our papers is here.
The format of annotation data file is as follows:
__________________________________
ID Query Question
Relevant Historical Question 1
Relevant Historical Question 2
... ...
__________________________________
If you need the question repository used in this paper, please send us an email and we will find a way to send you the whole repository extracted from Yahoo! Answers.
If you use our annotation data, please cite our papers:
Xin Cao, Gao Cong, B. Cui, Christian S. Jensen and Ce Zhang: "The Use of Categorization Information in Retrieving Questions ". The 18th ACM Conference on Information and Knowledge Management (CIKM), pp.265-274, 2009
Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen: "A Generalized Framework of Exploring Category Information for Question Retrieval in Community Question Answer Archives" . The 19th International World Wide Web Conference (WWW), pp 201-210, 2010