SEI(IIS): MotionEye: Querying and Mining Large Datasets of Moving
Objects
Contact Information
Jiawei
Han, PI
Department of Computer Science
University of Illinois, Urbana-Champaign
1304 West Springfield Ave. , Urbana, Illinois 61801 U.S.A.
Office: (217) 333-6903, Fax: (217) 265-6494
E-mail: hanj at cs.uiuc.edu, URL: http://www.cs.uiuc.edu/~hanj
List
of Supported Students and Staff
§ Hector Gonzalez, Ph.D.
student, Department of Computer Science, University of Illinois at Urbana-Champaign
(graduate 2008, now at Google Research)
§ Xiaolei Li, Ph.D. student, Department
of Computer Science, University of Illinois at Urbana-Champaign (graduate 2008,
now at Microsoft)
§ Tanyi Wu,
Ph.D. student, Department of Computer Science, University of Illinois at
Urbana-Champaign
§ Deng Cai, Ph.D. student, Department
of Computer Science, University of Illinois at Urbana-Champaign
§ Jae-Gil Lee, Postdoc and research scientist, Department of Computer
Science, University of Illinois at Urbana-Champaign (now at IBM Research, Almaden)
Project
Award Information
- Award Number: IIS-05-13678
- Duration: July 15, 2005―June 30, 2008
- Title: SEI(IIS): MotionEye:
Querying and Mining Large Datasets of Moving Objects
- Keywords: Spatiotemporal data mining, motion
mining, moving objects, scalable mining algorithms, data mining
applications
Project
Summary
The design
and development of powerful mechanisms for managing and mining large datasets
of moving objects information is an emerging direction in science and engineering
informatics. The subject becomes increasingly important as the world, as well
as national security threats, become more mobile. This project investigates the issues related
to the design and development of innovative methods for querying, analyzing,
and mining of spatiotemporal information to find typical characteristics of the
moving objects trajectories, and uncover suspicious motion in large datasets of
moving objects. The moving objects
datasets are in the form of either stored data or transient data streams. The project designs and implements a MotionEye system prototype which consists of four
subsystems: MotionQuest(DB) and MotionQuest(Stream) for
querying and hypothesis validation in moving objects databases and data streams
respectively, and MotionMine(DB) and MotionMine(Stream) for data mining in moving objects
databases and data streams respectively.
The project investigates efficient and effective approaches to the
implementation of these subsystems. The
project also strives to ensure that the developed technology will not sacrifice
individual privacy. The project will
enable the development of more advanced information systems in homeland
security, law enforcement, traffic control, and other domains that deal with
moving objects.
Publications and Products:
Journal articles (including accepted)
- Xiaoxin Yin, Jiawei Han and Philip S.
Yu, "Truth
Discovery with Multiple Conflicting Information Providers on the Web",
IEEE Transactions on Knowledge and Data Engineering, 20(6):796-808, 2008.
- Xiaofei He, Deng Cai,
Jiawei Han, Learning a
Maximum Margin Subspace for Image Retrieval, IEEE Transactions on
Knowledge and Data Engineering, 20(2):189-201, 2008.
- Deng Cai,
Xiaofei He, Jiawei Han, SRDA: An
Efficient Algorithm for Large Scale Discriminant
Analysis, IEEE Transactions on Knowledge and Data Engineering,
20(1):1-12, 2008.
- Jianyong
Wang, Jiawei Han, and Chun Li, Frequent Closed Sequence Mining without Candidate
Maintenance, IEEE Transactions on Knowledge and Data
Engineering, 19(8), 2007.
- Chulyun
Kim, Sangkyum Kim, Russell Dorer, Dan Xie, Jiawei Han, and Sheng
Zhong, TagSmart: Analysis and Visualization for Yeast Mutant
Fitness Data Measured by Tag Microarrays, BMC Bioinformatics,
8:128, April 2007.
- Jiawei
Han, Hong Cheng, Dong Xin, and Xifeng Yan, Frequent Pattern Mining: Current Status and Future
Directions, Data Mining and Knowledge Discovery, 15(1):
55-86, 2007. (Online version published on January 27, 2007, DOI
10.1007/s10618-006-0059-1 SpringerLink).
- Dong
Xin, Jiawei Han, Xiaolei Li, Zheng Shao, and Benjamin W. Wah, Computing Iceberg Cubes by Top-Down and Bottom-Up
Integration: The StarCubing Approach, IEEE Transactions
on Knowledge and Data Engineering, 19(1): 111-126, 2007.
8.
Chao
Liu, Long Fei, Xifeng Yan, Jiawei Han and Samuel P. Midkiff, Statistical Debugging: A Hypothesis Testing-based Approach,
IEEE Transaction on Software Engineering, 32(10): 831-848, 2006.
9.
Yixin
Chen, Guozhu Dong, Jiawei Han, Jian Pei, Benjamin W. Wah, and Jianyong Wang, Regression Cubes with Lossless Compression and Aggregation,
IEEE Transactions on Knowledge and Data Engineering, 18(12): 1585-1599, 2006.
10.
Jiangyong
Wang, Jiawei Han, and Jian Pei, Closed Constrained-Gradient Mining in Retail Databases,
IEEE Transactions on Knowledge and Data Engineering, 18(6): 764-769, 2006.
11.
Xiaoxin
Yin, Jiawei Han, Jiong Yang and Philip S. Yu, Efficient Classification across Multiple Database Relations:
A CrossMine Approach, IEEE Transactions on Knowledge and
Data Engineering, 18(6): 770-783, 2006.
12.
Charu
Aggarwal, Jiawei Han, Jianyong Wang, and Philip S. Yu, A Framework for On-Demand Classification of Evolving Data
Streams, IEEE Transactions on Knowledge and Data
Engineering, 18(5):577-789, 2006.
13.
Hwanjo
Yu, Jiong Yang, Jiawei Han, and Xiaolei Li, Making SVM Scalable to Large Data Sets Using Hierarchical
Indexing, Data Mining and Knowledge Discovery, 11(3):
295-321, 2005.
14.
Jiawei
Han, Yixin Chen, Guozhu Dong, Jian Pei, Benjamin W. Wah, Jianyong Wang, and Y.
Dora Cai, Stream Cube: An Architecture for Multi-Dimensional Analysis
of Data Streams, Distributed and Parallel Databases, 18(2):
173-197, 2005.
15.
Xifeng
Yan, Philip Yu, and Jiawei Han, Graph Indexing Based on Discriminative Frequent Structure
Analysis, ACM Transactions on Database Systems, 30(4):
960-993 2005.
16.
Deng
Cai, Xiaofei He and Jiawei Han, Document Clustering Using Locality Preserving Indexing,
IEEE Transactions on Knowledge and Data Engineering, 17(12):1624-1637, 2005.
17.
C.
Aggarwal, J. Han, J. Wang, and P. S. Yu, On Efficient Algorithms for High Dimensional Projected
Clustering of Data Streams, Data Mining and Knowledge
Discovery, 10:251-272, 2005.
18.
Petre
Tzvetkov, Xifeng Yan, Jiawei Han, TSP: Mining top-k closed sequential patterns,
Knowl. Inf. Syst., 7(4): 438-457, 2005.
19.
J.
Wang, J. Han, Y. Lu, and P. Tzvetkov, TFP: An Efficient Algorithm for Mining Top-K Frequent Closed
Itemsets, IEEE Transactions on Knowledge and Data
Engineering, 17(5):652-664, 2005.
20.
K.
Wang, Y. Jiang, J. X. Yu, G. Dong, and J. Han, Divide-and-Approximate: A Novel Constraint Push Strategy for
Iceberg Cube Mining, IEEE Transactions on Knowledge and Data
Engineering, 17(3):354-368, 2005.
21.
Xiaofei
He, Deng Cai, Haifeng Liu, Jiawei Han, Image clustering with tensor representation,
ACM Multimedia 2005:132-140
Book
and Book Chapters
- Xifeng
Yan and Jiawei Han, Discovery of Frequent Substructures, in D. Cook and
L. Holder (ed.), Mining Graph Data, John Wiley & Sons, pp. 99-115,
2007.
- Jiawei
Han and Micheline Kamber, Data Mining: Concepts and Techniques,
(Foreword by Jim Gray), 2nd ed., Morgan Kaufmann, 2006.
- Charu
C. Aggarwal, Jiawei Han, Jianyong Wang and Philip S. Yu, On Clustering
Massive Data Streams: A Summarization Paradigm, in C. C. Aggarwal (ed.),
Data Streams: Models and Algorithms, Kluwer Academic Publishers, pp. 9-38,
2006.
- Jiawei
Han, Y. Dora Cai, Yixin Chen, Guozhu Dong, Jian Pei, Benjamin W. Wah, and Jianyong
Wang, Multi-Dimensional Analysis of Data Streams Using Stream Cubes, in
C. C. Aggarwal (ed.), Data Streams: Models and Algorithms, Kluwer Academic
Publishers, pp. 103-126, 2006.
- Jiawei
Han, Hector Gonzalez, Xiaolei Li, and Diego Klabjan, Warehousing and Mining Massive RFID Data Sets
(an invited paper), in Xue Li, Osmar R. Zaiane, Zhanhuai Li (eds.), Proc.
2006 Int. Conf. Advanced Data Mining and Applications (ADMA'06), Xi'An,
China, August 2006, pp. 1-18. (Lecture Notes in Computer Science, Vol.
4093, Springer Berlin/Heidelberg, 2006).
- X.
Yin, J. Han, J. Yang and P. S. Yu, CrossMine: Efficient Classification
across Multiple Database Relations, in Jean-Francois Boulicaut, Luc de
Raedt, and Heikki Mannila (eds.), Constraint-Based Mining and Inductive
Databases, Springer-Verlag LNAI vol. 3848, pp. 172-195, 2006.
- Jiawei
Han, Benjamin W. Wah, Vijay Raghavan, Xindong Wu, and Rajeev Rastogi
(eds.), Proceedings of the Fifth Int. Conf. on Data Mining
(ICDM-2005), (Houston, Texas, Nov. 27--30, 2005) IEEE
Computer Society, New York, 2005. (846 + xxvii pages).
- J.
Yang, X. Yan, J. Han, and W. Wang, Discovering Evolutionary Classifier over High Speed
Non-Static Stream, in S. Bandyopadhyay et al.
(eds.), Advanced Methods for
Knowledge Discovery from Complex Data, Springer Verlag, 2005.
- J.
Han, J. Pei, and X. Yan, Sequential Pattern Mining by Pattern-Growth: Principles
and Extensions, in W. W. Chu and T. Y. Lin (eds.),
Recent Advances in Data Mining and Granular Computing (Mathematical
Aspects of Knowledge Discovery), Springer Verlag, 2005.
Refereed
Conference Publications (Refereed
Workshop Publications are omitted due to limited space)
1. Chen
Chen, Xifeng Yan, Feida Zhu, Jiawei Han, and Philip S. Yu, "Graph OLAP: Towards Online Analytical Processing on Graphs",
Proc. 2008 Int. Conf. on Data Mining (ICDM'08), Pisa, Italy, Dec. 2008.
2. Deng
Cai, Xiaofei He, Xiaoyun Wu, and Jiawei Han, Non-negative Matrix Factorization on Manifold,
Proc. 2008 Int. Conf. on Data Mining (ICDM'08), Pisa, Italy, Dec. 2008.
3. Cindy
Xide Lin, Bolin Ding, Jiawei Han, Feida Zhu, and Bo Zhao, "Text Cube: Computing IR Measures for Multidimensional Text
Database Analysis", Proc. 2008 Int. Conf. on Data
Mining (ICDM'08), Pisa, Italy, Dec. 2008.
4. Luiz
Mendes, Bolin Ding, and Jiawei Han, "Stream Sequential Pattern Mining with Precise Error Bounds",
Proc. 2008 Int. Conf. on Data Mining (ICDM'08), Pisa, Italy, Dec. 2008.
5. Mohammad
Masud, Jing Gao, Latifur Khan, Jiawei Han, and Bhavani Thuraisingham, "A Practical Approach to Classify Evolving Data Streams:
Training with Limited Amount of Labeled Data", Proc.
2008 Int. Conf. on Data Mining (ICDM'08), Pisa, Italy, Dec. 2008.
15.
Tianyi
Wu, Dong Xin, and Jiawei Han, ARCube: Supporting Ranking Aggregate Queries in Partially
Materialized Data Cubes,
Proc. 2008 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD'08),
Vancouver, BC, Canada, June 2008.
16.
Xifeng
Yan, Hong Cheng, Jiawei Han, and Philip S. Yu, Mining Significant Graph Patterns by Scalable Leap Search, Proc. 2008 ACM SIGMOD Int. Conf. on
Management of Data (SIGMOD'08), Vancouver, BC, Canada, June 2008.
17.
Xiaolei
Li, Jiawei Han, Zhijun Yin, Jae-Gil Lee, and Yizhou Sun, Sampling Cube: A Framework for Statistical OLAP over
Sampling Data, Proc. 2008 ACM SIGMOD Int. Conf. on
Management of Data (SIGMOD'08), Vancouver, BC, Canada, June 2008.
18.
Yizhou
Sun, Tianyi Wu, Hong Cheng, Jiawei Han, Xiaoxin Yin, and Peixiang Zhao, BibNetMiner: Mining Bibliographic Information Networks,
(demo paper), Proc. 2008 ACM SIGMOD Int. Conf. on Management of Data
(SIGMOD'08), Vancouver, BC, Canada, June 2008.
19.
Sangkyum
Kim, Xin Jin and Jiawei Han, SpaRClus: Spatial Relationship Pattern-Based Hierarchical
Clustering, Proc. 2008 SIAM Int. Conf. on Data Mining
(SDM'08), Atlanta, GA, April 2008.
20.
Ding
Yuan, Kyuhyung Lee, Hong Cheng, Gopal Krishna, Zhenmin Li, Xiao Ma, Yuanyuan
Zhou and Jiawei Han, CISpan: Comprehensive Incremental Mining Algorithms of
Closed Sequential Patterns for Multi-Versional Software Mining,
Proc. 2008 SIAM Int. Conf. on Data Mining (SDM'08), Atlanta, GA, April 2008.
21.
Deng
Cai, Xiaofei He, and Jiawei Han, "Training Linear Discriminant Analysis in Linear Time",
Proc. 2008 Int. Conf. on Data Engineering (ICDE'08), Cancun, Mexico, April
2008.
22.
Hong
Cheng, Xifeng Yan, Jiawei Han, and Philip S. Yu, "Direct Discriminative Pattern Mining for Effective
Classification", Proc. 2008 Int. Conf. on Data
Engineering (ICDE'08), Cancun, Mexico, April 2008.
23. Jae-Gil
Lee, Jiawei Han, and Xiaolei Li, "Trajectory Outlier Detection: A Partition-and-Detect
Framework", Proc. 2008 Int. Conf. on Data Engineering
(ICDE'08), Cancun, Mexico, April 2008.
24.
Dong
Xin and Jiawei Han, "P-Cube: Answering Preference Queries in Multi-Dimensional
Space", Proc. 2008 Int. Conf. on Data Engineering
(ICDE'08), Cancun, Mexico, April 2008.
25. Chao
Liu, Xiangyu Zhang, Jiawei Han, Yu Zhang and Bharat K. Bhargava, Failure Indexing: A Dynamic Slicing Based Approach,
in Proc. 2007 IEEE Int. Conf. on Software Maintenance (ICSM'07), Paris, France,
Oct. 2007.
26. Deng
Cai, Xiaofei He, and Jiawei Han, A Unified Subspace Learning Framework for Content-Based
Image Retrieval, in Proc. 2007 Int. Conf. on ACM Multimedia
(ACM-MM'07), Augsburg, Germany, Sept. 2007.
27. Tianyi
Wu, Yuguo Chen and Jiawei Han, Association Mining in Large Databases: A Re-Examination of
Its Measures, in Proc. 2007 Int. Conf. on Principles and
Practice of Knowledge Discovery in Databases (PKDD'07), Warsaw, Poland, Sept.
2007.
28. Chen
Chen, Xifeng Yan, Philip S. Yu, Jiawei Han, DongQing Zhang, and Xiaohui Gu, Towards Graph Containment Search and Indexing,
in Proc. 2007 Int. Conf. on Very Large Data Bases (VLDB'07), Vienna, Austria,
Sept. 2007.
29. Hector
Gonzalez, Jiawei Han, Xiaolei Li, Margaret Myslinska, and John Paul Sondag, Adaptive Fastest Path Computation on a Road Network: A
Traffic Mining Approach, in Proc. 2007 Int. Conf. on Very
Large Data Bases (VLDB'07), Vienna, Austria, Sept. 2007.
30. Xiaolei
Li and Jiawei Han, Mining Approximate Top-K Subspace Anomalies in
Multi-Dimensional Time-Series Data, in Proc. 2007 Int. Conf.
on Very Large Data Bases (VLDB'07), Vienna, Austria, Sept. 2007.
31. Tainyi
Wu, Xiaolei Li, Dong Xin, Jiawei Han, Jacob Lee, and Ricardo Redder, DataScope: Viewing Database Contents in Google Maps' Way,
in Proc. 2007 Int. Conf. on Very Large Data Bases (VLDB'07), Vienna, Austria,
Sept. 2007 (system demo).
32. Xiaoxin
Yin, Jiawei Han, and Philip S. Yu, Truth Discovery with Multiple Conflicting Information
Providers on the Web, in Proc. 2007 ACM SIGKDD Int. Conf. on
Knowledge Discovery and Data Mining (KDD'07), San Jose, CA, Aug. 2007.
33. Xiaolei
Li, Jiawei Han, Jae-Gil Lee, and Hector Gonzalez, Traffic Density-based Discovery of Hot Routes in Road
Networks, in Proc. 2007 Int. Symp. on Spatial and Temporal
Databases (SSTD'07), Boston, MA, July 2007.
34. Deng
Cai, Xiaofei He and Jiawei Han, Isometric Projection, in Proc. 2007 AAAI
Conf. on Artificial Intelligence (AAAI-07), Vancouver, B. C., Canada, July
2007.
35. Wen
Jin, Anthony K.H. Tung, Martin Ester, and Jiawei Han, On Efficient Processing of Subspace Skyline Queries on High
Dimensional Data, in Proc. 2007 Int. Conf. on Scientific and
Statistical Database Management (SSDBM'07), Banff, Canada, July 2007.
36. Deng
Cai, Xiaofei He, Yuxiao Hu, Jiawei Han, and Thomas Huang, Learning a Spatially Smooth Subspace for Face Recognition,
in Proc. 2007 IEEE Conf. on Computer Vision and Pattern Recognition (CVPR'07),
Minneapolis, MN, June 2007.
37. Jae-Gil
Lee, Jiawei Han, and Kyu-Young Whang, Trajectory Clustering: A Partition-and-Group Framework,
in Proc. 2007 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD'07), Beijing,
China, June 2007.
38. Dong
Xin, Jiawei Han, and Kevin C.-C. Chang, Progressive and Selective Merge: Computing Top-K with Ad-hoc
Ranking Functions, in Proc. 2007 ACM SIGMOD Int. Conf. on
Management of Data (SIGMOD'07), Beijing, China, June 2007.
39. Feida
Zhu, Xifeng Yan, Jiawei Han, and Philip S. Yu, gPrune: A Constraint Pushing Framework for Graph Pattern
Mining, in Proc. 2007 Pacific-Asia Conf. on Knowledge
Discovery and Data Mining (PAKDD'07), Nanjing, China, May 2007. (Best Student Paper Award)
40. Jiawei
Han, Hong Cheng, Dong Xin, and Xifeng Yan, Frequent Pattern Mining: Current Status and Future
Directions, Data Mining and Knowledge Discovery, 14(1),
2007. (Online version published on January 27, 2007, DOI 10.1007/s10618-006-0059-1
SpringerLink).
41. Jing
Gao, Wei Fan, and Jiawei Han, A General Framework for Mining Concept-Drifting Data Streams
with Skewed Distributions, in Proc. 2007 SIAM Int. Conf. on
Data Mining (SDM'07), Minneapolis, MN, April 2007.
42. Xiaolei
Li, Jiawei Han, Sangkyum Kim, and Hector Gonzalez, ROAM: Rule- and Motif-Based Anomaly Detection in Massive
Moving Object Data Sets, in Proc. 2007 SIAM Int. Conf. on
Data Mining (SDM'07), Minneapolis, MN, April 2007. (One of Best of
SDM07)
43. Hong
Cheng, Xifeng Yan, Jiawei Han, and Chih-Wei Hsu, Discriminative Frequent Pattern Analysis for Effective
Classification, in Proc. 2007 Int. Conf. on Data Engineering
(ICDE'07), Istanbul, Turkey, April 2007.
44. Feida
Zhu, Xifeng Yan, Jiawei Han, Philip S. Yu, and Hong Cheng, Mining Colossal Frequent Patterns by Core Pattern Fusion,
in Proc. 2007 Int. Conf. on Data Engineering (ICDE'07), Istanbul, Turkey, April
2007. (Best Student Paper Award)
45. Hector
Gonzalez, Jiawei Han, and Xuehua Shen, Cost-conscious Cleaning of Massive RFID Data Sets,
in Proc. 2007 Int. Conf. on Data Engineering (ICDE'07), Istanbul, Turkey, April
2007.
46. Xiaoxin
Yin, Jiawei Han, and Philip S. Yu, Object Distinction: Distinguishing Objects with Identical
Names by Link Analysis, in Proc. 2007 Int. Conf. on Data
Engineering (ICDE'07), Istanbul, Turkey, April 2007.
47. Wen
Jin, Martin Ester, Zengjian Hu, and Jiawei Han, The Multi-Relational Skyline Operator, in
Proc. 2007 Int. Conf. on Data Engineering (ICDE'07), Istanbul, Turkey, April
2007.
48. Deng
Cai, Xiaofei He, Kun Zhou, Jiawei Han and Hujun Bao, Locality Sensitive Discriminant Analysis,
in Proc. 2007 Int. Joint Conf. on Artificial Intelligence (IJCAI'07),
Hyderabad, India, Jan. 2007.
49. Chao
Liu, Zeng Lian, and Jiawei Han, How Bayesians Debug?, in Proc. 2006 Int.
Conf. on Data Mining (ICDM'06), Hong Kong, China, Dec. 2006.
50. Hong
Cheng, Philip S. Yu, and Jiawei Han, AC-Close: Efficiently Mining Approximate Closed Itemsets by
Core Pattern Recovery, in Proc. 2006 Int. Conf. on Data
Mining (ICDM'06), Hong Kong, China, Dec. 2006.
51. Chao
Liu and Jiawei Han, Failure Proximity: A Fault Localization-Based Approach,
in Proc. 14th ACM SIGSOFT Symposium on the Foundations of Software Engineering
(FSE'06), Portland, OR, Nov. 2006.
52. Hector
Gonzalez, Jiawei Han, and Xiaolei Li, Mining Compressed Commodity Workflows From Massive RFID Data
Sets, in Proc. 2006 Int. Conf. on Information and Knowledge
Management (CIKM'06), Arlington, VA, Nov. 2006.
53. Xiaoxin
Yin, Jiawei Han, and Philip Yu, LinkClus:
Efficient Clustering via Heterogeneous Semantic Links, in
Proc. 2006 Int. Conf. on Very Large Data Bases (VLDB'06), Seoul, Korea, Sept.
2006.
54. Hector
Gonzalez, Jiawei Han, and Xiaolei Li, FlowCube: Constructuing
RFID FlowCubes for Multi-Dimensional Analysis of Commodity Flows,
in Proc. 2006 Int. Conf. on Very Large Data Bases (VLDB'06), Seoul, Korea,
Sept. 2006.
55. Dong
Xin, Chen Chen, and Jiawei Han, Towards Robust
Indexing for Ranked Queries, in Proc. 2006 Int. Conf. on
Very Large Data Bases (VLDB'06), Seoul, Korea, Sept. 2006.
56. Dong
Xin, Jiawei Han, Hong Cheng, and Xiaolei Li, Answering Top-k
Queries with Multi-Dimensional Selections: The Ranking Cube Approach,
in Proc. 2006 Int. Conf. on Very Large Data Bases (VLDB'06), Seoul, Korea,
Sept. 2006.
57. Dong
Xin, Hong Cheng, Xifeng Yan, and Jiawei Han, Extracting Redundancy-Aware
Top-K Patterns, in Proc. 2006 ACM SIGKDD Int. Conf. on
Knowledge Discovery and Data Mining (KDD'06), Philadelphia, PA, Aug. 2006.
58. Qiaozhu
Mei, Dong Xin, Hong Cheng, ChengXiang Zhai, and Jiawei Han, Generating
Semantic Annotations for Frequent Patterns with Context Analysis,
in Proc. 2006 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining
(KDD'06), Philadelphia, PA, Aug. 2006. (Best Student
Paper Runner-Up Award)
59. Chao
Liu, Chen Chen, Jiawei Han, and Philip Yu, GPLAG: Detection
of Software Plagiarism by Procedure Dependency Graph Analysis,
in Proc. 2006 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining
(KDD'06), Philadelphia, PA, Aug. 2006.
60. Dong
Xin, Xuehua Shen, Qiaozhu Mei, and Jiawei Han, Discovering
Interesting Patterns Through User's Interactive Feedback, in
Proc. 2006 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining
(KDD'06), Philadelphia, PA, Aug. 2006.
61. Deng
Cai, Xiaofei He and Jiawei Han, Tensor Space Model
for Document Analysis, in Proc. 2006 Int. ACM SIGIR Conf. on
Research & Development on Information Retrieval (SIGIR'06), Seattle, WA,
Aug. 2006.
62. Hongyan
Liu, Ying Lu, Jiawei Han, and Jun He, Error-Adaptive and
Time-Aware Maintenance of Frequency Counts over Data Streams,
in Proc. 2006 Int. Conf. on Web-Age Information Management (WAIM'06), Hong
Kong, China, June, 2006.
63. Kaushik
Chakrabarti, Venkatesh Ganti, Jiawei Han, and Dong Xin, Ranking Objects
Based on Relationships, in Proc. 2006 ACM SIGMOD Int. Conf. on
Management of Data (SIGMOD'06), Chicago, IL, June 2006.
64. Xiaolei
Li, Jiawei Han, and Sangkyum Kim, Motion-Alert:
Automatic Anomaly Detection in Massive Moving Objects, Proc.
2006 IEEE Int. Conf. on Intelligence and Security Informatics (ISI'06), San
Diego, CA, May 2006.
65. Wen
Jin, Anthony K. H. Tung, Jiawei Han, and Wei Wang, Ranking Outliers
Using Symmetric Neighborhood Relationship, in Proc. 2006
Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD'06),
Singapore, April 2006.
66. Hongyan
Liu, Jiawei Han, Dong Xin, and Zheng Shao, Mining Interesting
Patterns from Very High Dimensional Data: A Top-Down Row Enumeration Approach,
in Proc. 2006 SIAM Int. Conf. on Data Mining (SDM'06), Bethesda, MD, April
2006. (One of Best of SDM06)
67. Chao
Liu, Xifeng Yan, and Jiawei Han, Mining Control
Flow Abnormality for Logic Error Isolation, in Proc. 2006
SIAM Int. Conf. on Data Mining (SDM'06), Bethesda, MD, April 2006.
68. Charu
Aggarwal, Chen Chen, and Jiawei Han, On the Inverse
Classification Problem and its Applications, in Proc. 2006
Int. Conf. on Data Engineering (ICDE'06), Atlanta, Georgia, April 2006.
69. Hector
Gonzalez, Jiawei Han, Xiaolei Li, and
Diego Klabjan, Warehousing and
Analysis of Massive RFID Data Sets, in Proc. 2006 Int. Conf.
on Data Engineering (ICDE'06), Atlanta, Georgia, April 2006. (Best Student Paper Award)
70. Hongyan
Liu, Jiawei Han, Dong Xin, and Zheng Shao, Top-Down Mining of
Interesting Patterns from Very High Dimensional Data, in
Proc. 2006 Int. Conf. on Data Engineering (ICDE'06), Atlanta, Georgia, April
2006.
71. Dong
Xin, Jiawei Han, Zheng Shao, and Hongyan Liu, C-Cubing:
Efficient Computation of Closed Cubes by Aggregation-Based Checking,
in Proc. 2006 Int. Conf. on Data Engineering (ICDE'06), Atlanta, Georgia, April
2006.
72. Xifeng
Yan, Feida Zhu, Jiawei Han, and Philip Yu, Searching
Substructures with Superimposed Distance, in Proc. 2006 Int.
Conf. on Data Engineering (ICDE'06), Atlanta, Georgia, April 2006.
73. Deng
Cai, Zheng Shao, Xiaofei He, Xifeng Yan, Jiawei Han, Community Mining
from Multi-Relational Networks, in Proc. 2005 European Conf.
on Principles and Practice of Knowledge Discovery in Databases (PKDD'05),
Porto, Portugal, Oct., 2005.
74. Wen
Jin, Martin Ester and Jiawei Han, Efficient
Processing of Ranked Queries with Sweeping Selection, in Proc. 2005 European Conf. on Principles
and Practice of Knowledge Discovery in Databases (PKDD'05), Porto, Portugal,
Oct., 2005.
75. Xiaoxin
Yin and Jiawei Han, Efficient
Classification from Multiple Heterogeneous Databases, in
Proc. 2005 European Conf. on Principles and Practice of Knowledge Discovery in
Databases (PKDD'05), Porto, Portugal, Oct., 2005.
76. C.
Liu, X. Yan, L. Fei, J. Han, and S. Midkiff, SOBER: Statistical Model-based Bug Localization,
Proc. 2005 ACM SIGSOFT Symp. on the Foundations of Software Engineering (FSE
2005), Lisbon, Portugal, Sept. 2005.
77. D.
Xin, J. Han, X. Yan and H. Cheng, Mining Compressed Frequent-Pattern Sets, Proc.
2005 Int. Conf. on Very Large Data Bases (VLDB'05), Trondheim, Norway, Aug.
2005.
78. X.
Yan, H. Cheng, J. Han, and D. Xin, Summarizing Itemset Patterns: A Profile-Based Approach,
Proc. 2005 Int. Conf. on Knowledge Discovery and Data Mining (KDD'05), Chicago,
IL, Aug. 2005. (Best Student Paper Runner-Up Award)
79. X.
Yan, X. J. Zhou, and J. Han, Mining Closed Relational Graphs with Connectivity
Constraints, Proc. 2005 Int. Conf. on Knowledge Discovery
and Data Mining (KDD'05), Chicago, IL, Aug. 2005.
80. X.
Yin, J. Han, and P.S. Yu, Cross-Relational Clustering with User's Guidance,
Proc. 2005 Int. Conf. on Knowledge Discovery and Data Mining (KDD'05), Chicago,
IL, Aug. 2005.
81. S.
Cong, J. Han, and D. Padua, Parallel Mining of Closed Sequential Patterns,
Proc. 2005 Int. Conf. on Knowledge Discovery and Data Mining (KDD'05), Chicago,
IL, Aug. 2005.
82. D.
Cai and X. He. Orthogonal Locality Preserving Indexing,
Proc. 2005 Int. Conf. on Research and Development in Information Retrieval
(SIGIR'05), Salvador, Brazil, Aug. 2005.
Project
Impact
§ Education: Parts
of the new research results are used in Data Mining courses (CS412, CS512)
for both undergraduate and graduate students being taught in the Department of
Computer Science, the University of Illinois at Urbana-Champaign.
Moreover, the research results have been and will continuously be
published timely in international conferences and journals and be distributed
world-wide for education and research. The new progress will also be
integrated into the new edition of my data mining textbook and other research
collections.
§ Collaborations: For this project we have established collaborations
with IBM T.J. Watson Research Center, Microsoft Research, Boeing, Intel,
Google, and NCSA (National Center of Supercomputer Applications). Through
such collaborations we expect to have access to real datasets and applications
and produce more research results.
Current
and Future Activities
The
following are some of the highlights of our ongoing work. Please refer to the section: Publications and
Products section for related references
§ Development
of efficient and scalable mechanisms for mining moving object.
§ Development
of multi-dimensional stream data analysis techniques.
§ Development
of efficient and effective methods for RFID data warehousing and data mining.
§ Development
of efficient methods for mining frequent, sequential and structured patterns
This
project is based on the previous works on data mining, stream
data/query processing, and moving object databases.
There have been many research papers published on these themes.
Several textbooks provide good overviews of data mining principles and
algorithms, including (Han and Kamber, 2006), (Hand, Mannila, and
Smyth, 2001) and (Hastie, Tibshirani, and Friedman, 2001) and
moving object databases.
Area
References
1.
C.
Aggarwal, J. Han, J. Wang, and P. S. Yu. A Framework for Clustering Evolving Data Streams. VLDB 2003.
2.
B.
Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom. Models and issues in data stream systems. POD 2002.
3.
Y.
Chen, G. Dong, J. Han, B. W. Wah, and J. Wang, Multi-Dimensional
Regression Analysis of Time-Series Data Streams, VLDB 2002.
4.
E.
Frentzos, K. Gratsias, N. Pelekis, and Y. Theodoridis. Nearest neighbor
search on moving object trajectories. Proc. 2005
Int. Symp. Spatial and Temporal Databases
(SSTD'05), 2005.
5.
H.
Gonzalez, J. Han, and X. Li, FlowCube: Constructuing RFID FlowCubes for
Multi-Dimensional Analysis of Commodity Flows, VLDB
2006.
6.
H.
Gonzalez, J. Han, X. Li, and D. Klabjan,
Warehousing and
Analysis of Massive RFID Data Sets, ICDE 2006.
7.
R.
H. Gueting and M. Schneider, Moving
Objects Databases, Morgan Kaufmann, 2005.
- Jiawei
Han and Micheline Kamber, Data Mining: Concepts and Techniques,
2nd ed., Morgan Kaufmann, 2006.
9.
T.
Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning:
Data Mining, Inference, and Prediction, Springer-Verlag 2001.
10. X.
Li, J. Han, and S. Kim, Motion-Alert: Automatic Anomaly Detection in
Massive Moving Objects, Proc. 2006 IEEE Int. Conf. on
Intelligence and Security Informatics (ISI'06), San Diego, CA, May 2006.
11. T.
Zhang, R. Ramakrishnan, and M. Livny. BIRCH: An efficient data clustering method
for very large databases. SIGMOD 1996.
Potential
Related Projects
This
project is related to most of data mining and moving object data processing and
data mining. In particularly, it is related to P.I.'s NSF IIS 020-9199
(Mining Sequential and Structured Patterns: Scalability, Flexibility,
Extensibility and Applicability), and P.I.'s NSF IIS-03-08215 (Mining Dynamics
of Data Streams in Multi-Dimensional Space).
We wish to collaborate or exchange research ideas with most of the
research projects related to knowledge discovery in databases, querying and
mining moving object data sets, stream data processing, and their applications,
such as homeland security, etc.
Project
Web site URL: http://www.cs.uiuc.edu/~hanj/projs/motioneye.htm
Online
software: Online software related to this project can be
downloaded at www.illimine.cs.uiuc.edu
Online
resources: Research publications related to this project can be
downloaded at Selected Publications