Publications (Back)


Text Mining, Information Retrieval

 

1.      Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen: A Generalized Framework of Exploring Category Information for Question Retrieval in Community Question Answer Archives. The 17th International World Wide Web Conference (WWW) 2010

2.      Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen, Ce Zhang: The Use of Categorization Information in Language Models for Question Retrieval. Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM) 2009 (supplementary materials on dataset)

3.      Yanhong ZhouGao CongBin CuiChristian S. Jensen,  Junjie Yao: Routing Questions to Right Users in Online Communities. Proceedings of the ICDE 2009

4.      Ce Ce Zhang, Cui Bin, Gao Cong, YuJing Wang: A Revisit of Query Expansion with Different Semantic Levels, Proceedings of the DASFAA 2009

5.      Gao Cong, Long Wang, Chin-Yew Lin, Y.I. Song, Y. Sun: Finding Question-Answer Pairs from Online Forums. Proceedings of the 31st Annual International ACM SIGIR Conference, 2008

6.      Shilin Ding, Gao Cong, Chin-Yew Lin and Xiaoyan Zhu:  Using Conditional Random Fields to Extract Contexts and Answers of Questions from Online Forums. The 46th Annual Meeting of the Association for Computational Linguistics. ACL 2008 supplementary materials

7.      Ce Zhang, YuJing Wang, Bin Cui, Gao Cong: Semantic Similarity Based on Compact Concept Ontology.   Proceedings of the WWW 2008 (Poster)

8.      Guihua Sun, Gao Cong, Xiaohua Liu, Chin-Yew Lin, Ming Zhou. Detecting Erroneous Sentences using labeled sequential Patterns and Tree Patterns. Proceedings of the AAAI 2007.

9.      Guihua Sun, Xiaohua Liu, Gao Cong, Ming Zhou, Zhongyang Xiong, Chin-Yew Lin, John Lee. Detecting Erroneous Sentences using Automatically Mined Sequential Patterns. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics.

10.  Bin Cui, Anirban Mondal, Jialie Shen, Gao Cong, Kian-Lee Tan: On effective E-mail Classification via Neural Networks. Proceedings of the 16th International Conference on Database and Expert Systems Applications (DEXA) 2005

11.  Gao Cong, Weesun Lee, Haoran Wu, Bing Liu. Semi-Supervised Text Classification Using Partitioned EM. Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA) 2004


Database (Integration with IR, data cleaning, XML )

 

1.      Bin Cui, J. Zhao and Gao Cong: ISIS: A New Approach for Efficient Similarity Search in Sparse Databases. The 15th International Conference on Database Systems for Advanced Applications (DASFAA) 2010

2.      Gao Cong, Christian S. Jensen, Dingming Wu: Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects. Proceedings of 35th International Conference on Very Large Data Bases (VLDB 2009).2009

3.      Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas: Updating Recursive XML Views of Relations. Journal of Computer Science and Technology 2008

4.      Gao Cong, Wenfei Fan, Floris Geerts, Xibei Jia, Shuai Ma: Improving Data Quality: Consistency and Accuracy. Proceedings of the VLDB 2007

5.      Gao Cong: Query and Update Through XML Views. Proceedings of the DNIS 2007 (Invited paper)

6.      Wenfei Fan, Gao Cong, Philip Bohannon: Query XML with Update Syntax. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007

7.      Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Distributed Query Evaluation with Performance Guarantees. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007

8.      Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas Updating recursive XML views of relations. Proceedings of the 23rd International Conference on Database Engineering (ICDE), 2007.

9.       Gao Cong, Wenfei Fan, Floris Geerts . Annotation Propagation Revisited for Key Preserving Views . Proceedings of the 15th ACM Conference on Information and Knowledge Management (CIKM), 2006.

10.   Peter Buneman, Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Using Partial Evaluation in Distributed Query Evaluation. Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), 2006

11.  Bin Cui, Jialie Shen, Gao Cong, Heng Tao Shen, Cui Yu. Composite Acoustic Features for Efficient Music Similarity Query. Proceedings of 14th ACM International Conference on Multimedia (ACM MM), 2006.

12.  Hanyu Li,  Mong-Li LeeWynne Hsu, and Gao Cong: An Estimation System for XPath Expressions. Proceedings of the 22th IEEE International Conference on Data Engineering (ICDE)2006

 

Data Mining and Bioinformatics

 

1.      Yuanzhe Cai, Gao Cong, Xu Jia, Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du. Efficient Algorithms for Computing Link-based Similarity in Real World Networks. Proceedings of the IEEE International Conference on Data Mining (ICDM). 2009 (Short Paper)

2.      Gao Cong, Bin Cui, Yingxin Li, Zonghong Zhang. Summarizing frequent patterns using profiles. Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA) 2006

3.       Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Xin Xu. Mining Top-k Covering Rule Groups for Gene Expression Data. Proceedings of the ACM International Conference on Management of Data (SIGMOD) 2005

4.      Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Feng Pan: Mining Frequent Closed Patterns in Microarray Data. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2004

5.      Cuiping Li, Gao Cong, Anthony K. H. Tung Shan Wang. Large Incremental Maintenance of Quotient Cube for Sum and Median. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2004

6.      Xin Xu, Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung: Semantic Mining and Analysis of Gene Expression Data (Demo). Proceedings of the 30th International Conference on Very Large Data Bases (VLDB) 2004

7.      Gao Cong, Anthony K. H. Tung, Xin Xu, Feng Pan, Jiong Yang. FARMER: Fining Interesting Association Rule Groups by Row Enumeration in Biological Datasets. Proceedings of the 23rd ACM International Conference on Management of Data (SIGMOD) 2004

8.      Feng Pan, Anthony K. H. Tung, Gao Cong, Xin Xu. COBBLER: Combining Column and Row Enumeration for Closed Pattern Discovery. Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM) 2004

9.      Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung. Go Green: Recycle and Reuse Frequent Patterns. Proceedings of the 20th IEEE International Conference on Data Engineering (ICDE) 2004

10.  Feng Pan, Gao Cong, Anthony K. H. Tung, Jiong Yang, Mohammed J. Zaki. CARPENTER: Finding Closed Patterns in Long Biological Datasets. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2003

11.  Gao Cong, Bing Liu: Speed-up Iterative Frequent Itemset Mining with Constraint Changes. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2002

12.  Gao Cong, Lan Yi, Bing Liu, Ke Wang: Discovering frequent substructures from hierarchical semi-structured data.  Proceedings of the Second SIAM International Conference on Data Mining, (SDM). 2002