|
Gao Cong (丛 高)
Department of Computer Science, Aalborg University, Denmark Center for Data-intensive
Systems (Daisy). Email gaocong
at cs.aau.dk |
2008- Assistant Professor in Aalborg University, Denmark
2007 - 2008: Researcher in Microsoft Research Asia, Beijing
2004 - 2006: Postdoc Research Fellow in the Database group within the Laboratory for Foundations of Computer Science within the School of Informatics of University of Edinburgh.
2000-2003: Ph.D. student in the Department of Computer Science, NUS, Singapore.
1999-2000: ShenZhen Huawei Company, China
1992-1999: M. Eng and B.Eng. in MIS from Tianjin University, Tianjin, China.
Database, Data Mining, Text Mining, Information Retrieval
My current research:
·
Mining forums (including Community Question Answering)
and social network
o Extracting Question
Answer pairs from forums to enrich the knowledge base of CQA services (related
papers: SIGIR08, ACL08 )
o Routing questions to
expert users (related paper: ICDE09)
o The use of
categorization information to improve question search (related paper: CIKM09,
WWW2010)
o Computing link-based
similarity (related paper: ICDM09)
·
Integrating information retrieval and database
systems
o
Efficient processing of spatial keyword queries (related paper: VLDB09)
My past research interests include classifying gene expression
data, mining frequent patterns, XML, etc.
1. Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen: A Generalized Framework of Exploring Category Information for Question Retrieval in Community Question Answer Archives. The 17th International World Wide Web Conference (WWW) 2010
2. Bin
Cui, J. Zhao and Gao Cong: ISIS: A New Approach for Efficient Similarity Search in Sparse
Databases. The 15th International Conference on Database Systems for
Advanced Applications (DASFAA) 2010
3. Yuanzhe Cai, Gao Cong, Xu Jia,
Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du. Efficient Algorithms for Computing Link-based Similarity in Real World
Networks. Proceedings of the IEEE International Conference on Data Mining (ICDM). 2009 (Short Paper)
4. Xin Cao, Gao
Cong, Bin Cui, Christian S. Jensen,
Ce Zhang: The Use of Categorization
Information in Language Models for Question Retrieval. Proceedings of the
18th ACM Conference on Information and Knowledge Management (CIKM) 2009 (supplementary materials on
dataset)
5. Gao Cong, Christian S. Jensen, Dingming Wu: Efficient
Retrieval of the Top-k Most Relevant Spatial Web Objects. Proceedings of 35th International
Conference on Very Large Data Bases (VLDB).2009
6. Yanhong Zhou, Gao Cong, Bin Cui, Christian S. Jensen, Junjie Yao: Routing Questions to Right Users in Online Communities. Proceedings of the ICDE 2009
7. Ce Zhang, Cui Bin, Gao Cong, YuJing Wang: A Revisit of Query Expansion with Different Semantic Levels, Proceedings of the DASFAA 2009
8. Gao Cong, Long Wang, Chin-Yew Lin, Y.I. Song, Y. Sun: Finding Question-Answer Pairs from Online Forums. Proceedings of the 31st Annual International ACM SIGIR Conference, 2008
9. Shilin Ding, Gao Cong, Chin-Yew Lin and Xiaoyan Zhu: Using Conditional Random Fields to Extract Contexts and Answers of Questions from Online Forums. The 46th Annual Meeting of the Association for Computational Linguistics. ACL 2008 supplementary materials
10. Ce Zhang, YuJing Wang, Bin Cui, Gao Cong: Semantic Similarity Based on Compact Concept Ontology. Proceedings of the WWW 2008 (Poster)
11. Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas: Updating Recursive XML Views of Relations. Journal of Computer Science and Technology 2008
12. Gao Cong, Wenfei Fan, Floris Geerts, Xibei Jia, Shuai Ma: Improving Data Quality: Consistency and Accuracy. Proceedings of the VLDB 2007
13. Gao Cong: Query and Update Through XML Views. Proceedings of the DNIS 2007 (Invited paper)
14. Wenfei Fan, Gao Cong, Philip Bohannon: Query XML with Update Syntax. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007
15. Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Distributed Query Evaluation with Performance Guarantees. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007
16. Guihua Sun, Gao Cong, Xiaohua Liu, Chin-Yew Lin, Ming Zhou. Detecting Erroneous Sentences using labeled sequential Patterns and Tree Patterns. Proceedings of the AAAI 2007.
17. Guihua Sun, Xiaohua Liu, Gao Cong, Ming Zhou, Zhongyang Xiong, Chin-Yew Lin, John Lee. Detecting Erroneous Sentences using Automatically Mined Sequential Patterns. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. ACL 2007.
18. Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas Updating recursive XML views of relations. Proceedings of the 23rd International Conference on Database Engineering (ICDE), 2007.
19. Gao Cong, Wenfei Fan, Floris Geerts . Annotation Propagation Revisited for Key Preserving Views. Proceedings of the 15th ACM Conference on Information and Knowledge Management (CIKM), 2006.
20. Peter Buneman, Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Using Partial Evaluation in Distributed Query Evaluation. Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), 2006
21. Bin Cui, Jialie Shen, Gao Cong, Heng Tao Shen, Cui Yu. Composite Acoustic Features for Efficient Music Similarity Query. Proceedings of 14th ACM International Conference on Multimedia (ACM MM), 2006.
22. Hanyu Li, Mong-Li Lee, Wynne Hsu, and Gao Cong: An Estimation System for XPath Expressions. Proceedings of the 22th IEEE International Conference on Data Engineering (ICDE)2006
23. Gao Cong, Bin Cui, Yingxin Li, Zonghong Zhang. Summarizing frequent patterns using
profiles. Proceedings of the 9th International
Conference on Database Systems for Advanced
Applications (DASFAA)
2006
24. Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Xin Xu. Mining Top-k Covering Rule Groups for Gene Expression Data. Proceedings of the ACM International Conference on Management of Data (SIGMOD) 2005
25. Bin Cui, Anirban Mondal, Jialie Shen, Gao Cong, Kian-Lee Tan: On effective E-mail Classification via Neural Networks. Proceedings of the 16th International Conference on Database and Expert Systems Applications (DEXA) 2005
26. Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Feng Pan: Mining Frequent Closed Patterns in Microarray Data. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2004
27. Cuiping Li, Gao Cong, Anthony K. H. Tung Shan Wang. Large Incremental Maintenance of Quotient Cube for Sum and Median. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2004
28. Xin Xu, Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung: Semantic Mining and Analysis of Gene
Expression Data (Demo). Proceedings
of the 30th International Conference on Very Large Data Bases (VLDB)
2004
29. Gao Cong, Anthony
K. H. Tung, Xin Xu, Feng Pan, Jiong
Yang. FARMER: Fining Interesting
Association Rule Groups by Row Enumeration in Biological Datasets. Proceedings of the 23rd ACM International Conference on
Management of Data (SIGMOD) 2004
30. Feng Pan, Anthony K. H. Tung, Gao Cong, Xin Xu. COBBLER: Combining Column and Row Enumeration for Closed Pattern Discovery. Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM) 2004
31. Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung. Go Green: Recycle and Reuse Frequent Patterns. Proceedings of the 20th IEEE International Conference on Data Engineering (ICDE) 2004
32. Gao Cong, Weesun Lee, Haoran Wu, Bing Liu. Semi-Supervised
Text Classification Using Partitioned EM. Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA) 2004
33. Feng Pan, Gao Cong, Anthony K. H. Tung, Jiong Yang, Mohammed J. Zaki. CARPENTER: Finding Closed Patterns in Long
Biological Datasets. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining (KDD) 2003
34. Gao Cong, Bing Liu: Speed-up Iterative Frequent Itemset Mining with Constraint Changes. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2002
35. Gao Cong, Lan Yi, Bing Liu, Ke Wang: Discovering frequent substructures from hierarchical semi-structured data. Proceedings of the Second SIAM International Conference on Data Mining, (SDM). 2002
Internet Technologies 2009
Database Technology 2009 (with Simonas, and Ira)
2 SW3 Groups 2009
Semester Coordinator for SW3 2009
1 DAT5 Group 2009
1 DAT4 Group 2009
Data warehouse and Data mining 2008 Fall (with Thomas and Manfred)
Introduction to Internet Technologies 2008 (with Ken, Man Lung Yiu)
Database Technology 2008 (with my colleagues)
1 DAT5 Group 2008 (Co-supervised with Torben)
Seminar coordinator for Edinburgh
Database Group's weekly seminar Feb. - Sept. 2006
Program Committee
2010:
KDD
WWW Poster track
International Conference on
Advances in Social Network Analysis and Data Mining (ASONAM)
DASFAA; DEXA; DNIS
2009:
VLDB
ICDE
The IEEE international conference on Data Mining
(ICDM).
The International Conference on Advances in Social Network
Analysis and Data Mining
The 1st International Workshop on Web-based Contents Management
Technologies
PAKDD; DEXA; APWeb
2008
and earlier:
SIGMOD 2008
KDD 2008
ICDM 2008
ICDE 2007
VLDB Demo track 2008
DEXA 2008
XANTEC 2008
WWW Poster track 2008
Apweb 2007, 2008
DNIS 2007
The International Conference on Web-Age Information Management
(WAIM) 2005, 2006, 2007, 2008
International Conference on Database Systems for Advanced
Applications (DASFAA) 2006, 2007, 2008
ICDE Text Mining Workshop 2007
International Workshop on High Performance Data Mining and
Applications (HPDMA'07) 2007
The 2nd International Conference on Availability, Reliability and
Security (AReS) 2007, 2008
18th IEEE International Conference on Tools with Artificial
Intelligent (ICTAI) 2006
Conference on Information and Knowledge Management (CIKM) 2006
DEXA workshop on Data Management in Global-Scale Data Repositories
(GRep) 2005 2006
Reviewer:
IEEE Transactions on Knowledge and Data Engineering (TKDE);
VLDB Journal;
Bioinformatics;
IEEE Intelligent Systems Special issue on Data Mining for
Bioinformatics;
Journal of Bioinformatics and Computational Biology (JBCB);
International Journal of Information Technology;
ICDE 2005, BNCOD 2005, EDBT 2006, COMAD 2006, WWW2006, VLDB 2006,
SSDBM2007, SIGMOD 2007
The Chinese webpage
that we created for the School of Informatics, University of Edinburgh