Sciweavers

241 search results - page 14 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
SIGIR
2011
ACM
14 years 14 days ago
Pseudo test collections for learning web search ranking functions
Test collections are the primary drivers of progress in information retrieval. They provide a yardstick for assessing the effectiveness of ranking functions in an automatic, rapi...
Nima Asadi, Donald Metzler, Tamer Elsayed, Jimmy L...
PDP
2008
IEEE
15 years 4 months ago
Distributed Sparse Spatial Selection Indexes
Searching for similar objects in metric-space databases can be efficiently solved by using index data structures. A number of alternative sequential indexes have been proposed in...
Veronica Gil Costa, Mauricio Marín
SCCC
1998
IEEE
15 years 1 months ago
Parallel Generation of Inverted Files for Distributed Text Collections
We present a scalable algorithm for the parallel computation of inverted files for large text collections. The algorithm takes into account an environment of a high bandwidth netw...
Berthier A. Ribeiro-Neto, Joao Paulo Kitajima, Gon...
66
Voted
ICDAR
2011
IEEE
13 years 9 months ago
Greek Polytonic OCR Based on Efficient Character Class Number Reduction
—Recognition of document images having Greek polytonic (multi accent) characters is a challenging task due the large number of existing character classes (more than 270). In this...
Basilios Gatos, Georgios Louloudis, Nikolaos Stama...
SEKE
2010
Springer
14 years 8 months ago
Incremental Construction of Topic Hierarchies using Hierarchical Term Clustering
Topic hierarchies are very useful for managing, searching and browsing large repositories of text documents. The hierarchical clustering methods are used to support the constructi...
Ricardo M. Marcacini, Solange O. Rezende