Sciweavers

IDA
2007
Springer
13 years 3 months ago
An overview of clustering methods
Mahamed G. Omran, Andries Petrus Engelbrecht, Ayed...
IDA
2007
Springer
13 years 3 months ago
Voting experts: An unsupervised algorithm for segmenting sequences
We describe a statistical signature of chunks and an algorithm for finding chunks. While there is no formal definition of chunks, they may be reliably identified as configurat...
Paul R. Cohen, Niall M. Adams, Brent Heeringa
IDA
2007
Springer
13 years 3 months ago
Anomaly detection in data represented as graphs
An important area of data mining is anomaly detection, particularly for fraud. However, little work has been done in terms of detecting anomalies in data that is represented as a g...
William Eberle, Lawrence B. Holder
IDA
2007
Springer
13 years 3 months ago
An evaluation of Naive Bayes variants in content-based learning for spam filtering
We describe an in-depth analysis of spam-filtering performance of a simple Naive Bayes learner and two extended variants. A set of seven mailboxes comprising about 65,000 mails f...
Alexander K. Seewald
IDA
2007
Springer
13 years 3 months ago
Inference of node replacement graph grammars
Graph grammars combine the relational aspect of graphs with the iterative and recursive aspects of string grammars, and thus represent an important next step in our ability to dis...
Jacek P. Kukluk, Lawrence B. Holder, Diane J. Cook
IDA
2007
Springer
13 years 3 months ago
Second-order uncertainty calculations by using the imprecise Dirichlet model
Natural extension is a powerful tool for combining the expert judgments in the framework of imprecise probability theory. However, it assumes that every judgment is “true” and...
Lev V. Utkin
IDA
2007
Springer
13 years 3 months ago
Removing biases in unsupervised learning of sequential patterns
Unsupervised sequence learning is important to many applications. A learner is presented with unlabeled sequential data, and must discover sequential patterns that characterize th...
Yoav Horman, Gal A. Kaminka
IDA
2007
Springer
13 years 3 months ago
Approximate mining of frequent patterns on streams
Abstract. This paper introduces a new algorithm for approximate mining of frequent patterns from streams of transactions using a limited amount of memory. The proposed algorithm co...
Claudio Silvestri, Salvatore Orlando
IDA
2007
Springer
13 years 3 months ago
An unsupervised clustering approach for leukaemia classification based on DNA micro-arrays data
: DNA micro-arrays provide thousands of genomic expressions on the same subject. A main issue is then to find the subset of genes whose degeneration is responsible of a certain typ...
Simone Garatti, Sergio Bittanti, Diego Liberati, A...
IDA
2007
Springer
13 years 3 months ago
WWW traffic measure and its properties
Abstract. We present a method to extract a time series (Number of Active Requests (NAR)) from web cache logs which serves as a transport level measurement of internet traffic. This...
Marcus R. Keogh-Brown, Barbara Bogacka