Progressive and Interactive Analysis of Event Data Using Event Miner

Exploring large data sets typically involves activities that iterate between data selection and data analysis, in which insights obtained from analysis result in new data selection. Further, data analysis needs to use a combination of analysis techniques: data summarization, mining algorithms and visualization. This interweaving of functions arises both from the semantics of what the analyst hopes to achieve and from scalability requirements for dealing with large data volumes. We refer to such a process as a progressive analysis. herein is described a tool, Event Miner, that integrates data selection, mining and visualization tor progressive analysis of temporal, categorical data. We discuss a data model and architecture. We illustrate how our tool can be used to complex mining tasks such as finding patterns not occurring on Monday. Further, we discuss the novel visualization employed, such as visualizing categorical data and the results of data mining. Also, we discuss the extension of the existing mining framework needed to mine temporal events with multiple attributes. Throughout, we illustrate the capabilities of Even Miner by applying it to event data from large computer networks.

By: Sheng Ma, Joseph L. Hellerstein, Chang-shing Perng, Genady Grabarnik

Published in: Proceedings 2002 IEEE International Conference on Data Mining. Los Alamitos, CA, , IEEE Computer Society Press. , p.661-4 in 2002

Please obtain a copy of this paper from your local library. IBM cannot distribute this paper externally.

Questions about this service can be mailed to reports@us.ibm.com .