Sciweavers

367 search results - page 47 / 74
» Indexing Text Documents Based on Topic Identification
Sort
View
SIGIR
2004
ACM
15 years 5 months ago
Configurable indexing and ranking for XML information retrieval
Indexing and ranking are two key factors for efficient and effective XML information retrieval. Inappropriate indexing may result in false negatives and false positives, and impro...
Shaorong Liu, Qinghua Zou, Wesley W. Chu
WWW
2006
ACM
16 years 13 days ago
Towards practical genre classification of web documents
Classification of documents by genre is typically done either using linguistic analysis or term frequency based techniques. The former provides better classification accuracy than...
George Ferizis, Peter Bailey
HICSS
2002
IEEE
191views Biometrics» more  HICSS 2002»
15 years 4 months ago
Mindmap: Utilizing Multiple Taxonomies and Visualization to Understand a Document Collection
We present a novel system and methodology for browsing and exploring topics and concepts within a document collection. The process begins with the generation of multiple taxonomie...
W. Scott Spangler, Jeffrey T. Kreulen, Justin Less...
SIGIR
2002
ACM
14 years 11 months ago
Risk minimization and language modeling in text retrieval dissertation abstract
tion Abstract ChengXiang Zhai (Advisor: John Lafferty) Language Technologies Institute School of Computer Science Carnegie Mellon University With the dramatic increase in online in...
ChengXiang Zhai
ICDAR
2005
IEEE
15 years 5 months ago
Skew Estimation for Scanned Documents from "Noises"
The vast majority of the published skew estimation methods for scanned document images are for textual documents. These methods are based on the principle that the skew angles can...
Bo Yuan, Chew Lim Tan