Gao Cong   (丛 高)

 

Department of Computer Science,  Aalborg University,  Denmark

Center for Data-intensive Systems (Daisy).

Email    gaocong at cs.aau.dk

 

 

Short Biography

2008- Assistant Professor in Aalborg University, Denmark

2007 - 2008: Researcher in Microsoft Research Asia, Beijing

2004 - 2006:  Postdoc Research Fellow in the Database group within the Laboratory for Foundations of Computer Science within the School of Informatics of University of Edinburgh.

2000-2003: Ph.D. student in the Department of Computer Science, NUS, Singapore.

1999-2000: ShenZhen Huawei Company, China

1992-1999: M. Eng and B.Eng. in MIS from Tianjin University, Tianjin, China. 

Research Interests

     

Database, Data Mining, Text Mining, Information Retrieval   

My current research:

·         Mining forums (including Community Question Answering) and social network

o   Extracting Question Answer pairs from forums to enrich the knowledge base of CQA services (related papers: SIGIR08, ACL08 )

o   Routing questions to expert users (related paper: ICDE09)

o   The use of categorization information to improve question search (related paper: CIKM09, WWW2010)

o   Computing link-based similarity (related paper: ICDM09)

·         Integrating information retrieval and database systems 

o   Efficient processing of spatial keyword queries (related paper: VLDB09)

My past research interests include classifying gene expression data, mining frequent patterns, XML, etc.

Publications  (By topics)

1.      Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen: A Generalized Framework of Exploring Category Information for Question Retrieval in Community Question Answer Archives. The 17th International World Wide Web Conference (WWW) 2010

2.      Bin Cui, J. Zhao and Gao Cong: ISIS: A New Approach for Efficient Similarity Search in Sparse Databases. The 15th International Conference on Database Systems for Advanced Applications (DASFAA) 2010

3.      Yuanzhe Cai, Gao Cong, Xu Jia, Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du. Efficient Algorithms for Computing Link-based Similarity in Real World Networks. Proceedings of the IEEE International Conference on Data Mining (ICDM). 2009 (Short Paper)

4.      Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen, Ce Zhang: The Use of Categorization Information in Language Models for Question Retrieval. Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM) 2009 (supplementary materials on dataset)

5.      Gao Cong, Christian S. Jensen, Dingming Wu: Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects. Proceedings of 35th International Conference on Very Large Data Bases (VLDB).2009

6.      Yanhong ZhouGao CongBin CuiChristian S. Jensen,  Junjie Yao: Routing Questions to Right Users in Online Communities. Proceedings of the ICDE 2009

7.      Ce Zhang, Cui Bin, Gao Cong, YuJing Wang: A Revisit of Query Expansion with Different Semantic Levels, Proceedings of the DASFAA 2009

8.      Gao Cong, Long Wang, Chin-Yew Lin, Y.I. Song, Y. Sun: Finding Question-Answer Pairs from Online Forums. Proceedings of the 31st Annual International ACM SIGIR Conference, 2008

9.      Shilin Ding, Gao Cong, Chin-Yew Lin and Xiaoyan Zhu:  Using Conditional Random Fields to Extract Contexts and Answers of Questions from Online Forums. The 46th Annual Meeting of the Association for Computational Linguistics. ACL 2008 supplementary materials

10.  Ce Zhang, YuJing Wang, Bin Cui, Gao Cong: Semantic Similarity Based on Compact Concept Ontology.   Proceedings of the WWW 2008 (Poster)

11.  Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas: Updating Recursive XML Views of Relations. Journal of Computer Science and Technology 2008

12.  Gao Cong, Wenfei Fan, Floris Geerts, Xibei Jia, Shuai Ma: Improving Data Quality: Consistency and Accuracy. Proceedings of the VLDB 2007

13.  Gao Cong: Query and Update Through XML Views. Proceedings of the DNIS 2007 (Invited paper)

14.  Wenfei Fan, Gao Cong, Philip Bohannon: Query XML with Update Syntax. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007

15.  Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Distributed Query Evaluation with Performance Guarantees. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007

16.  Guihua Sun, Gao Cong, Xiaohua Liu, Chin-Yew Lin, Ming Zhou. Detecting Erroneous Sentences using labeled sequential Patterns and Tree Patterns. Proceedings of the AAAI 2007.

17.  Guihua Sun, Xiaohua Liu, Gao Cong, Ming Zhou, Zhongyang Xiong, Chin-Yew Lin, John Lee. Detecting Erroneous Sentences using Automatically Mined Sequential Patterns. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. ACL 2007.

18.  Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas Updating recursive XML views of relations. Proceedings of the 23rd International Conference on Database Engineering (ICDE), 2007.

19.   Gao Cong, Wenfei Fan, Floris Geerts . Annotation Propagation Revisited for Key Preserving Views. Proceedings of the 15th ACM Conference on Information and Knowledge Management (CIKM), 2006.

20.   Peter Buneman, Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Using Partial Evaluation in Distributed Query Evaluation. Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), 2006

21.  Bin Cui, Jialie Shen, Gao Cong, Heng Tao Shen, Cui Yu. Composite Acoustic Features for Efficient Music Similarity Query. Proceedings of 14th ACM International Conference on Multimedia (ACM MM), 2006.

22.  Hanyu Li,  Mong-Li LeeWynne Hsu, and Gao Cong: An Estimation System for XPath Expressions. Proceedings of the 22th IEEE International Conference on Data Engineering (ICDE)2006

23.  Gao Cong, Bin Cui, Yingxin Li, Zonghong Zhang. Summarizing frequent patterns using profiles. Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA) 2006

24.   Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Xin Xu. Mining Top-k Covering Rule Groups for Gene Expression Data. Proceedings of the ACM International Conference on Management of Data (SIGMOD) 2005

25.  Bin Cui, Anirban Mondal, Jialie Shen, Gao Cong, Kian-Lee Tan: On effective E-mail Classification via Neural Networks. Proceedings of the 16th International Conference on Database and Expert Systems Applications (DEXA) 2005

26.  Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Feng Pan: Mining Frequent Closed Patterns in Microarray Data. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2004

27.  Cuiping Li, Gao Cong, Anthony K. H. Tung Shan Wang. Large Incremental Maintenance of Quotient Cube for Sum and Median. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2004

28.  Xin Xu, Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung: Semantic Mining and Analysis of Gene Expression Data (Demo). Proceedings of the 30th International Conference on Very Large Data Bases (VLDB) 2004

29.  Gao Cong, Anthony K. H. Tung, Xin Xu, Feng Pan, Jiong Yang. FARMER: Fining Interesting Association Rule Groups by Row Enumeration in Biological Datasets. Proceedings of the 23rd ACM International Conference on Management of Data (SIGMOD) 2004

30.  Feng Pan, Anthony K. H. Tung, Gao Cong, Xin Xu. COBBLER: Combining Column and Row Enumeration for Closed Pattern Discovery. Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM) 2004

31.  Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung. Go Green: Recycle and Reuse Frequent Patterns. Proceedings of the 20th IEEE International Conference on Data Engineering (ICDE) 2004

32.  Gao Cong, Weesun Lee, Haoran Wu, Bing Liu. Semi-Supervised Text Classification Using Partitioned EM. Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA) 2004

33.  Feng Pan, Gao Cong, Anthony K. H. Tung, Jiong Yang, Mohammed J. Zaki. CARPENTER: Finding Closed Patterns in Long Biological Datasets. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2003

34.  Gao Cong, Bing Liu: Speed-up Iterative Frequent Itemset Mining with Constraint Changes. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2002

35.  Gao Cong, Lan Yi, Bing Liu, Ke Wang: Discovering frequent substructures from hierarchical semi-structured data.  Proceedings of the Second SIAM International Conference on Data Mining, (SDM). 2002

 

Teaching

            Internet Technologies  2009

            Database Technology 2009 (with Simonas, and Ira)

            2 SW3 Groups 2009

            Semester Coordinator for SW3 2009

            1 DAT5 Group 2009

            1 DAT4 Group 2009

            Data warehouse and Data mining  2008 Fall (with Thomas and Manfred)

Introduction to Internet Technologies 2008 (with Ken, Man Lung Yiu)

Database Technology 2008 (with my colleagues)

1 DAT5 Group 2008 (Co-supervised with Torben)

 

Services

          Seminar coordinator for Edinburgh Database Group's weekly seminar           Feb. - Sept. 2006

Program Committee

2010:

KDD

WWW Poster track

International Conference on Advances in Social Network Analysis and Data Mining (ASONAM)

DASFAA; DEXA; DNIS

            2009:

VLDB

ICDE

The IEEE international conference on Data Mining (ICDM).

The International Conference on Advances in Social Network Analysis and Data Mining

The 1st International Workshop on Web-based Contents Management Technologies 

PAKDD; DEXA; APWeb

            2008 and earlier:

SIGMOD 2008

KDD 2008

ICDM 2008

ICDE 2007

VLDB Demo track 2008

DEXA 2008

XANTEC 2008

WWW Poster track 2008

Apweb 2007, 2008

DNIS 2007

The International Conference on Web-Age Information Management (WAIM) 2005, 2006, 2007, 2008

International Conference on Database Systems for Advanced Applications (DASFAA) 2006, 2007, 2008

ICDE Text Mining Workshop 2007

International Workshop on High Performance Data Mining and Applications (HPDMA'07) 2007

The 2nd International Conference on Availability, Reliability and Security (AReS) 2007, 2008

18th IEEE International Conference on Tools with Artificial Intelligent (ICTAI) 2006

Conference on Information and Knowledge Management (CIKM) 2006

DEXA workshop on Data Management in Global-Scale Data Repositories (GRep) 2005 2006

 

Reviewer:

IEEE Transactions on Knowledge and Data Engineering (TKDE);

VLDB Journal;

Bioinformatics;

IEEE Intelligent Systems Special issue on Data Mining for Bioinformatics;

Journal of Bioinformatics and Computational Biology (JBCB);

International Journal of Information Technology;

ICDE 2005, BNCOD 2005, EDBT 2006, COMAD 2006, WWW2006, VLDB 2006, SSDBM2007, SIGMOD 2007

 

Others

The Chinese webpage that we created for the School of Informatics, University of Edinburgh