Sciweavers

3180 search results - page 199 / 636
» Knowledge-based Document Analysis
Sort
View
SIGIR
1998
ACM
15 years 8 months ago
Improved Algorithms for Topic Distillation in a Hyperlinked Environment
This paper addresses the problem of topic distillation on the World Wide Web, namely, given a typical user query to find quality documents related to the query topic. Connectivity...
Krishna Bharat, Monika Rauch Henzinger
DOCENG
2008
ACM
15 years 6 months ago
Identifying and expanding titles in web texts
In this paper, we present an analysis based on linguistic and typographic features that allows for the identification of titles in web documents. We focus in particular on procedu...
Clémentine Adam, Estelle Delpech, Patrick S...
DATESO
2007
85views Database» more  DATESO 2007»
15 years 5 months ago
Improvement of Text Compression Parameters Using Cluster Analysis
Abstract. Several actions are usually performed when document is appended to textual database in information retrieval system. The most frequent actions are compression of the docu...
Jiri Dvorský, Jan Martinovic
EACL
2006
ACL Anthology
15 years 5 months ago
Computing Term Translation Probabilities with Generalized Latent Semantic Analysis
Term translation probabilities proved an effective method of semantic smoothing in the language modelling approach to information retrieval. We use Generalized Latent Semantic Ana...
Irina Matveeva, Gina-Anne Levow
ICDAR
2007
IEEE
15 years 10 months ago
On-Line Handwritten Text Line Detection Using Dynamic Programming
In this paper we propose a novel approach to the detection of on-line handwritten text lines based on dynamic programming. We try to find the paths with the minimum cost between ...
Marcus Liwicki, Emanuel Indermühle, Horst Bun...