Sciweavers

2827 search results - page 382 / 566
» Marking Text Documents
Sort
View
ICPR
2004
IEEE
16 years 7 months ago
Italic Font Recognition Using Stroke Pattern Analysis on Wavelet Decomposed Word Images
This paper describes an italic font recognition method using stroke pattern analysis on wavelet decomposed word images. The word images are extracted from scanned text documents c...
Chew Lim Tan, Li Zhang, Yue Lu
WWW
2002
ACM
16 years 6 months ago
Searching with numbers
A large fraction of the useful web comprises of specification documents that largely consist of hattribute name, numeric valuei pairs embedded in text. Examples include product in...
Rakesh Agrawal, Ramakrishnan Srikant
EDBT
2009
ACM
123views Database» more  EDBT 2009»
16 years 28 days ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...
PAKDD
2009
ACM
116views Data Mining» more  PAKDD 2009»
16 years 28 days ago
Scalable Web Mining with Newistic
Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...
Ovidiu Dan, Horatiu Mocian
ICDAR
2009
IEEE
16 years 27 days ago
Recognition of Degraded Handwritten Characters Using Local Features
The main problems of Optical Character Recognition (OCR) systems are solved if printed latin text is considered. Since OCR systems are based upon binary images, their results are ...
Markus Diem, Robert Sablatnig