Gao Cong   (丛 高)

 

Department of Computer Science,  Aalborg University,  Denmark

Center for Data-intensive Systems (Daisy).

Email    gaocong at cs.aau.dk

 

 

Short Biography

2008- Assistant Professor in Aalborg University, Denmark

2007 - 2008: Researcher in Microsoft Research Asia, Beijing

2004 - 2006:  Postdoc Research Fellow in the Database group within the Laboratory for Foundations of Computer Science within the School of Informatics of the University of Edinburgh.

2000-2003: Ph.D. student in the Department of Computer Science, NUS, Singapore.

1999-2000: ShenZhen Huawei Company, China

1992-1999: M. Eng and B.Eng. in MIS from Tianjin University, Tianjin, China. 

Research Interests

     

Database, Data Mining, Text Mining, Information Retrieval   

My current research:

·         Mining forums (including Community Question Answering) and social network

o   Extracting Question Answer pairs from forums to enrich the knowledge base of CQA services (related papers: SIGIR08, ACL08 )

o   Routing questions to expert users (related paper: ICDE09)

o   The use of categorization information to improve question search (related paper: CIKM09)

o   Computing link-based similarity (related paper: ICDM09)

·         Integrating information retrieval and database systems 

o   Efficient processing of spatial keyword queries (related paper: VLDB09)

My past research interests include classifying gene expression data, mining frequent patterns, XML, etc.

Publications  (By topics)

 

1.      Yuanzhe Cai, Gao Cong, Xu Jia, Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du. Efficient Algorithms for Computing Link-based Similarity in Real World Networks. Proceedings of the IEEE International Conference on Data Mining (ICDM). 2009 (Short Paper)

2.      Xin Cao, Gao Cong, Bin Cui, Christian S. Jensen, Ce Zhang: The Use of Categorization Information in Language Models for Question Retrieval. Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM) 2009 (supplementary materials on dataset)

3.      Gao Cong, Christian S. Jensen, Dingming Wu: Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects. Proceedings of 35th International Conference on Very Large Data Bases (VLDB).2009

4.      Yanhong ZhouGao CongBin CuiChristian S. Jensen,  Junjie Yao: Routing Questions to Right Users in Online Communities. Proceedings of the ICDE 2009

5.      Ce Zhang, Cui Bin, Gao Cong, YuJing Wang: A Revisit of Query Expansion with Different Semantic Levels, Proceedings of the DASFAA 2009

6.      Gao Cong, Long Wang, Chin-Yew Lin, Y.I. Song, Y. Sun: Finding Question-Answer Pairs from Online Forums. Proceedings of the 31st Annual International ACM SIGIR Conference, 2008

7.      Shilin Ding, Gao Cong, Chin-Yew Lin and Xiaoyan Zhu:  Using Conditional Random Fields to Extract Contexts and Answers of Questions from Online Forums. The 46th Annual Meeting of the Association for Computational Linguistics. ACL 2008 supplementary materials

8.      Ce Zhang, YuJing Wang, Bin Cui, Gao Cong: Semantic Similarity Based on Compact Concept Ontology.   Proceedings of the WWW 2008 (Poster)

9.      Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas: Updating Recursive XML Views of Relations. Journal of Computer Science and Technology 2008

10.  Gao Cong, Wenfei Fan, Floris Geerts, Xibei Jia, Shuai Ma: Improving Data Quality: Consistency and Accuracy. Proceedings of the VLDB 2007

11.  Gao Cong: Query and Update Through XML Views. Proceedings of the DNIS 2007 (Invited paper)

12.  Wenfei Fan, Gao Cong, Philip Bohannon: Query XML with Update Syntax. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007

13.  Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Distributed Query Evaluation with Performance Guarantees. Proceedings of ACM International Conference on Management of Data (SIGMOD) 2007

14.  Guihua Sun, Gao Cong, Xiaohua Liu, Chin-Yew Lin, Ming Zhou. Detecting Erroneous Sentences using labeled sequential Patterns and Tree Patterns. Proceedings of the AAAI 2007.

15.  Guihua Sun, Xiaohua Liu, Gao Cong, Ming Zhou, Zhongyang Xiong, Chin-Yew Lin, John Lee. Detecting Erroneous Sentences using Automatically Mined Sequential Patterns. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. ACL 2007.

16.  Byron Choi, Gao Cong, Wenfei Fan, Stratis D. Viglas Updating recursive XML views of relations. Proceedings of the 23rd International Conference on Database Engineering (ICDE), 2007.

17.   Gao Cong, Wenfei Fan, Floris Geerts . Annotation Propagation Revisited for Key Preserving Views . Proceedings of the 15th ACM Conference on Information and Knowledge Management (CIKM), 2006.

18.   Peter Buneman, Gao Cong, Wenfei Fan, Anastasios Kementsietsidis. Using Partial Evaluation in Distributed Query Evaluation. Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), 2006

19.  Bin Cui, Jialie Shen, Gao Cong, Heng Tao Shen, Cui Yu. Composite Acoustic Features for Efficient Music Similarity Query. Proceedings of 14th ACM International Conference on Multimedia (ACM MM), 2006.

20.  Hanyu Li,  Mong-Li LeeWynne Hsu, and Gao Cong: An Estimation System for XPath Expressions. Proceedings of the 22th IEEE International Conference on Data Engineering (ICDE)2006

21.  Gao Cong, Bin Cui, Yingxin Li, Zonghong Zhang. Summarizing frequent patterns using profiles. Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA) 2006

22.   Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Xin Xu. Mining Top-k Covering Rule Groups for Gene Expression Data. Proceedings of the ACM International Conference on Management of Data (SIGMOD) 2005

23.  Bin Cui, Anirban Mondal, Jialie Shen, Gao Cong, Kian-Lee Tan: On effective E-mail Classification via Neural Networks. Proceedings of the 16th International Conference on Database and Expert Systems Applications (DEXA) 2005

24.  Gao Cong, Kian-Lee Tan, Anthony K. H. Tung, Feng Pan: Mining Frequent Closed Patterns in Microarray Data. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2004

25.  Cuiping Li, Gao Cong, Anthony K. H. Tung Shan Wang. Large Incremental Maintenance of Quotient Cube for Sum and Median. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2004

26.  Xin Xu, Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung: Semantic Mining and Analysis of Gene Expression Data (Demo). Proceedings of the 30th International Conference on Very Large Data Bases (VLDB) 2004

27.  Gao Cong, Anthony K. H. Tung, Xin Xu, Feng Pan, Jiong Yang. FARMER: Fining Interesting Association Rule Groups by Row Enumeration in Biological Datasets. Proceedings of the 23rd ACM International Conference on Management of Data (SIGMOD) 2004

28.  Feng Pan, Anthony K. H. Tung, Gao Cong, Xin Xu. COBBLER: Combining Column and Row Enumeration for Closed Pattern Discovery. Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM) 2004

29.  Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung. Go Green: Recycle and Reuse Frequent Patterns. Proceedings of the 20th IEEE International Conference on Data Engineering (ICDE) 2004

30.  Gao Cong, Weesun Lee, Haoran Wu, Bing Liu. Semi-Supervised Text Classification Using Partitioned EM. Proceedings of the 9th International Conference on Database Systems for Advanced Applications (DASFAA) 2004

31.  Feng Pan, Gao Cong, Anthony K. H. Tung, Jiong Yang, Mohammed J. Zaki. CARPENTER: Finding Closed Patterns in Long Biological Datasets. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2003

32.  Gao Cong, Bing Liu: Speed-up Iterative Frequent Itemset Mining with Constraint Changes. Proceedings of the IEEE International Conference on Data Mining, (ICDM). 2002

33.  Gao Cong, Lan Yi, Bing Liu, Ke Wang: Discovering frequent substructures from hierarchical semi-structured data.  Proceedings of the Second SIAM International Conference on Data Mining, (SDM). 2002

 

Teaching

            Internet Technologies  2009

            Database Technology 2009 (with Simonas, and Ira)

            2 SW3 Groups 2009

            Semester Coordinator for SW3 2009

            1 DAT5 Group 2009

            1 DAT4 Group 2009

            Data warehouse and Data mining  2008 Fall (with Thomas and Manfred)

Introduction to Internet Technologies 2008 (with Ken, Man Lung Yiu)

Database Technology 2008 (with my colleagues)

1 DAT5 Group 2008 (Co-supervised with Torben)

 

 

 

Services

          Seminar coordinator for Edinburgh Database Group's weekly seminar           Feb. - Sept. 2006

Program Committee

VLDB 2009

ICDE 2007, 2009

The IEEE international conference on Data Mining (ICDM). 2008, 2009.

The 2009 International Conference on Advanced in Social Network Analysis  and Data Mining

The 1st International Workshop on Web-based Contents Management Technologies  2009

PAKDD 2009

SIGMOD 2008

KDD 2008

VLDB Demo track 2008

DEXA 2008, 2009, 2010

XANTEC 2008

WWW Poster track 2008

Apweb 2007, 2008, 2009

DNIS 2007

The International Conference on Web-Age Information Management (WAIM) 2005, 2006, 2007, 2008

International Conference on Database Systems for Advanced Applications (DASFAA) 2006, 2007, 2008, 2010

ICDE Text Mining Workshop 2007

International Workshop on High Performance Data Mining and Applications (HPDMA'07) 2007

The 2nd International Conference on Availability, Reliability and Security (AReS) 2007, 2008

18th IEEE International Conference on Tools with Artificial Intelligent (ICTAI) 2006

Conference on Information and Knowledge Management (CIKM) 2006

DEXA workshop on Data Management in Global-Scale Data Repositories (GRep) 2005 2006

 

Reviewer:

IEEE Transactions on Knowledge and Data Engineering (TKDE);

VLDB Journal;

Bioinformatics;

IEEE Intelligent Systems Special issue on Data Mining for Bioinformatics;

Journal of Bioinformatics and Computational Biology (JBCB);

International Journal of Information Technology;

ICDE 2005, BNCOD 2005, EDBT 2006, COMAD 2006, WWW2006, VLDB 2006, SSDBM2007, SIGMOD 2007

 

Others

The Chinese webpage that we created for the School of Informatics, University of Edinburgh