Sciweavers

241 search results - page 25 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
CGA
2010
14 years 7 months ago
Context-Preserving, Dynamic Word Cloud Visualization
In this paper, we introduce a visualization method that couples a trend chart with word clouds to illustrate temporal content evolutions in a set of documents. Specifically, we us...
Weiwei Cui, Yingcai Wu, Shixia Liu, Furu Wei, Mich...
ADC
2004
Springer
116views Database» more  ADC 2004»
15 years 3 months ago
Index Compression Using Fixed Binary Codewords
Document retrieval and web search engines index large quantities of text. The static costs associated with storing the index can be traded against dynamic costs associated with us...
Vo Ngoc Anh, Alistair Moffat
100
Voted
APWEB
2006
Springer
15 years 1 months ago
The Case of the Duplicate Documents Measurement, Search, and Science
Many of the documents in large text collections are duplicates and versions of each other. In recent research, we developed new methods for finding such duplicates; however, as the...
Justin Zobel, Yaniv Bernstein
DOCENG
2004
ACM
15 years 3 months ago
Presenting the results of relevance-oriented search over XML documents
In this paper, we discuss how to present the result of searching elements of any type from XML documents relevant to some information need (relevance-oriented search). As the resu...
Alda Lopes Gançarski, Pedro Rangel Henrique...
87
Voted
ICTAI
2009
IEEE
15 years 4 months ago
Classifying Sentence-Based Summaries of Web Documents
Text classification categories Web documents in large collections into predefined classes based on their contents. Unfortunately, the classification process can be time-consumi...
Maria Soledad Pera, Yiu-Kai Ng