Sciweavers

2926 search results - page 363 / 586
» Document Analysis
Sort
View
ICPR
2004
IEEE
16 years 5 months ago
Neural Network-Based Proper Names Extraction in Fax Images
In this paper, we are interested in the sender's name extraction in fax cover pages through a machine learning scheme. For this purpose, two analysis methods are implemented ...
Laurence Likforman-Sulem, Noura Azzabou
PAMI
2011
14 years 7 months ago
The Effect of Border Noise on the Performance of Projection-Based Page Segmentation Methods
— Projection methods have been used in the analysis of bi-tonal document images for different tasks like page segmentation and skew correction for over two decades. However, thes...
Faisal Shafait, Thomas M. Breuel
147
Voted
KDD
2007
ACM
169views Data Mining» more  KDD 2007»
16 years 5 months ago
Exploiting underrepresented query aspects for automatic query expansion
Users attempt to express their search goals through web search queries. When a search goal has multiple components or aspects, documents that represent all the aspects are likely ...
Daniel Crabtree, Peter Andreae, Xiaoying Gao
SMC
2010
IEEE
186views Control Systems» more  SMC 2010»
15 years 2 months ago
Semantic enrichment of text representation with wikipedia for text classification
—Text classification is a widely studied topic in the area of machine learning. A number of techniques have been developed to represent and classify text documents. Most of the t...
Hiroki Yamakawa, Jing Peng, Anna Feldman
144
Voted
AIRWEB
2005
Springer
15 years 10 months ago
Blocking Blog Spam with Language Model Disagreement
We present an approach for detecting link spam common in blog comments by comparing the language models used in the blog post, the comment, and pages linked by the comments. In co...
Gilad Mishne, David Carmel, Ronny Lempel