Sciweavers

2190 search results - page 255 / 438
» Large-scale extraction and use of knowledge from text
Sort
View
PAAMS
2010
Springer
14 years 11 months ago
A Case Study on Grammatical-Based Representation for Regular Expression Evolution
Abstract. Regular expressions, or simply regex, have been widely used as a powerful pattern matching and text extractor tool through decades. Although they provide a powerful and f...
Antonio González-Pardo, David F. Barrero, D...
CAIP
2007
Springer
134views Image Analysis» more  CAIP 2007»
15 years 5 months ago
An Efficient Method for Filtering Image-Based Spam E-mail
Spam e-mail with advertisement text embedded in images presents a great challenge to anti-spam filters. In this paper, we present a fast method to detect image-based spam e-mail. U...
Ngo Phuong Nhung, Tu Minh Phuong
NAACL
2003
15 years 3 months ago
A Generative Probabilistic OCR Model for NLP Applications
In this paper, we introduce a generative probabilistic optical character recognition (OCR) model that describes an end-to-end process in the noisy channel framework, progressing f...
Okan Kolak, William J. Byrne, Philip Resnik
CIKM
2010
Springer
14 years 11 months ago
A probabilistic topic-connection model for automatic image annotation
The explosive increase of image data on Internet has made it an important, yet very challenging task to index and automatically annotate image data. To achieve that end, sophistic...
Xin Chen, Xiaohua Hu, Zhongna Zhou, Caimei Lu, Gai...
SAC
2009
ACM
15 years 8 months ago
Combining statistics and semantics via ensemble model for document clustering
Incorporating background knowledge into data mining algorithms is an important but challenging problem. Current approaches in semi-supervised learning require explicit knowledge p...
Samah Jamal Fodeh, William F. Punch, Pang-Ning Tan