Sciweavers

2151 search results - page 249 / 431
» Using Document Dimensions for Enhanced Information Retrieval
Sort
View
SPIRE
2010
Springer
15 years 3 months ago
Dual-Sorted Inverted Lists
Several IR tasks rely, to achieve high efficiency, on a single pervasive data structure called the inverted index. This is a mapping from the terms in a text collection to the docu...
Gonzalo Navarro, Simon J. Puglisi
134
Voted
SIGIR
2010
ACM
15 years 8 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
ECIR
2003
Springer
15 years 6 months ago
Topic Detection and Tracking with Spatio-Temporal Evidence
Topic Detection and Tracking is an event-based information organization task where online news streams are monitored in order to spot new unreported events and link documents with ...
Juha Makkonen, Helena Ahonen-Myka, Marko Salmenkiv...
ICDIM
2008
IEEE
15 years 11 months ago
Unsupervised key-phrases extraction from scientific papers using domain and linguistic knowledge
The domain of Digital Libraries presents specific challenges for unsupervised information extraction to support both the automatic classification of documents and the enhancement ...
Mikalai Krapivin, Maurizio Marchese, Andrei Yadran...
WWW
2003
ACM
16 years 5 months ago
Peer-to-peer architecture for content-based music retrieval on acoustic data
In traditional peer-to-peer search networks, operations focus on properly labeled files such as music or video, and the actual search is often limited to text tags. The explosive ...
Cheng Yang