Sciweavers

24 search results - page 3 / 5
» Text Mining for Medical Documents Using a Hidden Markov Mode...
Sort
View
FLAIRS
2003
13 years 6 months ago
Orthographic Case Restoration Using Supervised Learning Without Manual Annotation
One challenge in text processing is the treatment of case insensitive documents such as speech recognition results. The traditional approach is to re-train a language model exclud...
Cheng Niu, Wei Li 0003, Jihong Ding, Rohini K. Sri...
ICDAR
2009
IEEE
13 years 11 months ago
HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation
Amharic is the official language of Ethiopia and uses Ethiopic script for writing. In this paper, we present writer-independent HMM-based Amharic word recognition for offline hand...
Yaregal Assabie, Josef Bigün
KDD
2007
ACM
167views Data Mining» more  KDD 2007»
14 years 5 months ago
Generalized component analysis for text with heterogeneous attributes
We present a class of richly structured, undirected hidden variable models suitable for simultaneously modeling text along with other attributes encoded in different modalities. O...
Xuerui Wang, Chris Pal, Andrew McCallum
ICIP
1999
IEEE
14 years 6 months ago
Color Documents on the Web with DJVU
We present a new image compression technique called DjVu" that is speci cally geared towards the compression of scanned documents in color at high resolution. With DjVu, a ma...
Bill Riemers, Léon Bottou, Pascal Vincent, ...
SIGIR
2003
ACM
13 years 10 months ago
Table extraction using conditional random fields
The ability to find tables and extract information from them is a necessary component of data mining, question answering, and other information retrieval tasks. Documents often c...
David Pinto, Andrew McCallum, Xing Wei, W. Bruce C...