In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
In order to return relevant search results, a search engine must keep its local repository synchronized to the Web, but it is usually impossible to attain perfect freshness. Hence...
Generalized association rules are rules that contain some background knowledge, therefore, giving a more general view of the domain. This knowledge is codified by a taxonomy set ...
Veronica Oliveira de Carvalho, Solange Oliveira Re...
We show that logical and behavioral equivalence for stochastic Kripke models over general measurable spaces are the same. Usually, this requires some topological assumptions and in...
Background: Network methods are increasingly used to represent the interactions of genes and/ or proteins. Genes or proteins that are directly linked may have a similar biological...