Sciweavers

3152 search results - page 186 / 631
» Retrieval of Partial Documents
Sort
View
70
Voted
IAJIT
2006
92views more  IAJIT 2006»
14 years 10 months ago
A Rule-Based Extensible Stemmer for Information Retrieval with Application to Arabic
: This paper presents a new and extensible method for information retrieval and content analysis in natural languages (NL). The proposed method is stem-based; stems are extracted b...
Haidar Harmanani, Walid Keirouz, Saeed Raheel
WWW
2007
ACM
15 years 11 months ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
63
Voted
WSDM
2010
ACM
210views Data Mining» more  WSDM 2010»
15 years 7 months ago
Leveraging Temporal Dynamics of Document Content in Relevance Ranking
Many web documents are dynamic, with content changing in varying amounts at varying frequencies. However, current document search algorithms have a static view of the document con...
Jonathan L. Elsas, Susan T. Dumais
93
Voted
DOCENG
2009
ACM
15 years 4 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
AICCSA
2008
IEEE
245views Hardware» more  AICCSA 2008»
15 years 4 months ago
Rapid and robust ranking of text documents in a dynamically changing corpus
Ranking documents in a selected corpus plays an important role in information retrieval systems. Despite notable advances in this direction, with continuously accumulating text do...
Byung-Hoon Park, Nagiza F. Samatova, Rajesh Munava...