Sciweavers

1523 search results - page 178 / 305
» Generalized contextualization method for XML information ret...
Sort
View
SIGIR
2010
ACM
15 years 1 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
WWW
2005
ACM
15 years 10 months ago
Focused crawling by exploiting anchor text using decision tree
Focused crawlers are considered as a promising way to tackle the scalability problem of topic-oriented or personalized search engines. To design a focused crawler, the choice of s...
Jun Li, Kazutaka Furuse, Kazunori Yamaguchi
KCAP
2009
ACM
15 years 4 months ago
Modeling multiple-event situations across news articles
Readers interested in the context of an event covered in the news such as the dismissal of a lawsuit can benefit from easily finding out about the overall news situation, the lega...
Earl J. Wagner, Larry Birnbaum, Kenneth D. Forbus
KCAP
2009
ACM
15 years 4 months ago
A catalogue of OWL ontology antipatterns
Debugging inconsistent OWL ontologies is a timeconsuming task. Debugging services included in existing ontology engineering tools are still far from providing adequate support to ...
Catherine Roussey, Óscar Corcho, Luis Manue...
WWW
2008
ACM
15 years 10 months ago
Collaborative filtering on skewed datasets
Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. In this paper, we observed that in skewed datasets...
Somnath Banerjee, Krishnan Ramanathan