Detection of Sequential Outliers Using a Variable Length Markov Model

15 years 4 months ago

Download www.lirmm.fr

Mining for outliers in sequential databases is crucial to forward appropriate analysis of data. Therefore, many approaches for the discovery of such anomalies have been proposed. However, most of them use a sample of known typical sequences to build the model. Besides, they remain greedy in terms of memory usage. In this paper we propose an extension of one such approach, based on a Probabilistic Suffix Tree and on a measure of similarity. We add a pruning criterion which reduces the size of the tree while improving the model, and a sharp inequality for the concentration of the measure of similarity, to better sort the outliers. We prove the feasability of our approach through a set of experiments over a protein database.

Cécile Low-Kam, Anne Laurent, Maguelonne Te

Real-time Traffic

ICMLA 2008 | Machine Learning | Probabilistic Suffix Tree | Sequential Databases | Typical Sequences |

claim paper

» Camera Motion Detection using Video Mosaicing

» Intrusion activity projection for cyber situational awareness

» Mocapy A toolkit for inference and learning in dynamic Bayesian networks

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	ICMLA
Authors	Cécile Low-Kam, Anne Laurent, Maguelonne Teisseire

Comments (0)

Sciweavers

Detection of Sequential Outliers Using a Variable Length Markov Model

ICMLA 2008 | Machine Learning | Probabilistic Suffix Tree | Sequential Databases | Typical Sequences |

Explore & Download

Productivity Tools

Sciweavers