Resources

Newsgroup

Past Class Presentation Schedule

Class meets at every Wednesday, 4:00-5:00pm, 3403 SC

Two students per unit (20 minutes presentation and 5 minutes discussion for each research paper, i.e., two papers will be covered per class unit.  However, we may occasionally extend the discussion to 30 minutes, i.e., working on only one paper per class unit time when needed).

You are encouraged to select the papers you believe are interesting and in excellent quality.  Strongly recommend to present the papers published or to be published in 2004 or 2005.  Please discuss with the course instructor before you finalize your paper selection.

Recommended conference proceedings: KDD, SIGMOD, VLDB, PODS, ICDE, WWW, ICDM, SDM, ICML, EDBT, CIKM, PKDD, PAKDD, SSDBM, etc.  You can access SIGMOD/PODS04 E-Proceedings, KDD04 E-Proceedings, VLDB04 E-Proceedings, ICDE05 E-Proceedings, SIGMOD/PODS05 E-Proceedings, KDD05 E-Proceedings, VLDB05 E-Proceedings, ICDE06 E-Proceedings, SIGMOD/PODS06 E-Proceedings, KDD06 E-Proceedings, VLDB06 E-Proceedings,  ICDE07 E-Proceedings, SIGMOD/PODS07 E-Proceedings, KDD’07 E-Proceedings, VLDB’07 E-Proceedings, ICDM'07 E-Proceedings, by clicking the corresponding links.  Use citeseer or other Web services to find the papers you want to select.

Recommended journals:  IEEE TKDE, DMKD (Data Mining and Knowledge Discovery), KDD Explorations, Machine Learning, ACM Trans. Database Systems, JIIS, Information Systems, VLDB Journal, Data and Knowledge Engineering, Knowledge and Information Systems (KAIS), etc.

Survey papers will usually be allocated as one full slot.

All the papers to be presented must give out the e-paper (one week before the presentation) and e-slides (4 hours before the presentation).


Presentations for CS591 (Data Mining) Spring 2007


Week 1.   (Aug. 29) 

Jiawei Han:  Class organization and recent research focus of our data mining research group

 

Week 2.   (Sept. 5) 

1.      Tianyi Wu: Tianyi Wu, Yuguo Chen and Jiawei Han, “Association Mining in Large Databases: A Re-Examination of Its Measures”, in Proc. 2007 Int. Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD'07), Warsaw, Poland, Sept. 2007.

2.      Chen Chen: Chen Chen, Xifeng Yan, Philip S. Yu, Jiawei Han, DongQing Zhang, and Xiaohui Gu, “Towards Graph Containment Search and Indexing”, in Proc. 2007 Int. Conf. on Very Large Data Bases (VLDB'07), Vienna, Austria, Sept. 2007.

 

Week 3.   (Sept. 12) 

1.      Deng Cai: Deng Cai, Xiaofei He, and Jiawei Han, “Efficient Kernel Discriminant Analysis via Spectral Regression”, Proc. 2007 Int. Conf. on Data Mining (ICDM'07), Omaha, NE, Oct. 2007.

 

Week 4.   (Sept. 19)   No Class (PKDD conference)

 

Week 5.   (Sept. 26)

1.      Peixiang Zhao: Peixiang Zhao, Jeffrey Xu Yu, Philip S. Yu. Graph Indexing: Tree + Delta >= Graph. VLDB 2007: 938-949

2.      Chen Chen: Chen Chen, Xifeng Yan, Feida Zhu, and Jiawei Han, “gApprox: Mining Frequent Approximate Patterns from a Massive Network”, Proc. 2007 Int. Conf. on Data Mining (ICDM'07), Omaha, NE, Oct. 2007.

 

Week 6.   (Oct. 3)

1.      Hector Gonzalez and Xiaolei Li: Report of VLDB’07 conference

2.      Deng Cai:  Deng Cai, Xiaofei He, and Jiawei Han, “A Unified Approach for Sparse Subspace Learning”, Proc. 2007 Int. Conf. on Data Mining (ICDM'07), Omaha, NE, Oct. 2007.

 

Week 7.   (Oct. 10) No Class (NSF Workshop)

 

Week 8.   (Oct. 17) 

·        (Feida Zhu) Feida Zhu, Xifeng Yan, Jiawei Han, and Philip S. Yu, “Efficient Discovery of Frequent Approximate Sequential Patterns”, Proc. 2007 Int. Conf. on Data Mining (ICDM'07), Omaha, NE, Oct. 2007.

·        (Jing Gao) Jing Gao, Wei Fan, and Jiawei Han, “On Appropriate Assumptions to Mine Data Streams: Analysis and Practice”, Proc. 2007 Int. Conf. on Data Mining (ICDM'07), Omaha, NE, Oct. 2007.

·        Note: Feida’s presentation will be replaced by Xide Lin with the following paper:  Haoliang Jiang,  Haixun Wang, Philip S. Yu, Shuigeng Zhou,GString: A Novel Approach for Efficient Search in Graph Databases”,  ICDE 07.

 

Week 9.   (Oct. 24) 

1.      Sangkyum Kim: Xiaowei Xu, Nurcan Yuruk, Zhidan Feng, and Thomas A. J. Schweiger, “SCAN: A Structural Clustering Algorithm for Networks”, The Thirteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, CA, Aug. 12-15, 2007
paper: http://ifsc.ualr.edu/xwxu/publications/kdd07.pdf
slide: http://velblod.videolectures.net/kdd/2007/kdd07_sanjose/xu_xiaowei/xiaowei-xu-scan.ppt

2.      Boling Ding:  Cuiping Li, Anthony Tung, Wen Jin, and Martin Ester. On Dominating Your Neighborhood Profitably. VLDB 2007, www.cs.uiuc.edu/homes/hanj/refs/vldb07/papers/research/p818-li.pdf

Related reference papers:

Cuiping Li, Beng Chin Ooi, Anthony K. H. Tung, and Shan Wang. DADA: A Data Cube for Dominant Relationship Analysis. SIGMOD 2006 www.cs.uiuc.edu/homes/hanj/refs/sigmodpods06/sigmod/p659.pdf

Jon M. Kleinberg, Christos H. Papadimitriou, Prabhakar Raghavan. A Microeconomic View of Data Mining. Data Min. Knowl. Discov.
1998 http://www.cs.cornell.edu/home/kleinber/dmkd98-seg.pdf

Jon M. Kleinberg, Christos H. Papadimitriou, Prabhakar Raghavan.
Segmentation Problems. STOC 1998 (Journal version: JACM 2004) http://www.cs.cornell.edu/home/kleinber/jacm04-seg.pdf

 

Week 10.   (Oct. 31)

1.      Yizhou Sun: Motoki Shiga, Ichigaku Takigawa, and Hiroshi Mamitsuka. “A Spectral Clustering Approach to Optimally Combining Numerical Vectors with a Modular Network”, http://www.cs.uiuc.edu/homes/hanj/refs/kdd07/docs/p647.pdf

2.      Xin Jin: Junsong Yuan, Ying Wu, Ming Yang: "From frequent itemsets to semantically meaningful visual patterns". KDD 2007: 864-873  http://www.cs.uiuc.edu/homes/hanj/refs/kdd07/docs/p864.pdf

 

Week 11.   (Nov. 7)

1.      Deng Cai, Chen Chen, Jing Gao, and Feida Zhu:  Report of ICDM’07 conference

2.      Chandrasekar Ramachandran: "Xin Dong, Alon Y. Halevy: Indexing dataspaces. SIGMOD Conference 2007: 43-54" (http://www.cs.uiuc.edu/homes/hanj/refs/sigmodpods07/sigmod/p43.pdf)

                                                              

Week 12.   (Nov. 14)

1.      Zhenhui Li: Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia, WWW 2007.

2.      Zhijun Yin:  Pedro DeRose, Warren Shen, Fei Chen, AnHai Doan (University of Wisconsin, USA), Raghu Ramakrishnan (Yahoo! Research, USA): Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach, VLDB07.

Week 13.   (Nov. 21)  No class (Thanksgiving break)

 

Week 14.   (Nov. 28)

1.      Jae-Gil Lee: Jae-Gil Lee, Jiawei Han, and Xiaolei Li, "Trajectory Outlier Detection: A Partition-and-Detect Framework", Proc. 2008 Int. Conf. on Data Engineering (ICDE'08), Cancun, Mexico, April 2008.

2.      Hong Cheng: Hong Cheng, Xifeng Yan, Jiawei Han, and Philip S. Yu, "Direct Discriminative Pattern Mining for Effective Classification", Proc. 2008 Int. Conf. on Data Engineering (ICDE'08), Cancun, Mexico, April 2008.

 

Week 15.   (Dec. 5) 

1.      Ok-Ran Jeong: Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang and Hsiao-Wuen Hon, “Webpage Understanding: an Integrated Approach, KDD 2007.