Sciweavers

378 search results - page 38 / 76
» Finding document topics for improving topic segmentation
Sort
View
IPM
2007
143views more  IPM 2007»
15 years 4 months ago
QCS: A system for querying, clustering and summarizing documents
Information retrieval systems consist of many complicated components. Research and development of such systems is often hampered by the difficulty in evaluating how each particula...
Daniel M. Dunlavy, Dianne P. O'Leary, John M. Conr...
WWW
2007
ACM
16 years 5 months ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
HVEI
2010
14 years 11 months ago
Interest of perceptive vision for document structure analysis
This work addresses the problem of document image analysis, and more particularly the topic of document structure recognition in old, damaged and handwritten document. The goal of...
Aurélie Lemaitre, Jean Camillerapp, Bertran...
SIGMOD
2004
ACM
150views Database» more  SIGMOD 2004»
16 years 4 months ago
When one Sample is not Enough: Improving Text Database Selection Using Shrinkage
Database selection is an important step when searching over large numbers of distributed text databases. The database selection task relies on statistical summaries of the databas...
Panagiotis G. Ipeirotis, Luis Gravano
MMM
2007
Springer
129views Multimedia» more  MMM 2007»
15 years 10 months ago
A New Method to Improve Multi Font Farsi/Arabic Character Segmentation Results: Using Extra Classes of Some Character Combinatio
A new segmentation algorithm for multifont Farsi/Arabic texts based on conditional labeling of up and down contours was presented in [1]. A preprocessing technique was used to adju...
Mona Omidyeganeh, Reza Azmi, Kambiz Nayebi, Abbas ...