We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
In this paper we present a scalable protocol for conducting periodic probes of network performance in a way that minimizes collisions between separate probes. The goal of the prot...
Richard Wolski, Benjamin Gaidioz, Bernard Touranch...
Consistent and flawless communication between humans and machines is the precondition for a computer to process instructions correctly. While machines use well-defined languages an...
We consider a parsed text corpus as an instance of a labelled directed graph, where nodes represent words and weighted directed edges represent the syntactic relations between the...
In this paper, we present a new approach for hand-written character and digit recognitions based on shape descriptor and the Hausdorff Context. We start at finding the correspondi...