Sciweavers

965 search results - page 108 / 193
» Text Types in Hypermedia
Sort
View
DAS
2010
Springer
15 years 9 months ago
Binarization of historical document images using the local maximum and minimum
This paper presents a new document image binarization technique that segments the text from badly degraded historical document images. The proposed technique makes use of the imag...
Bolan Su, Shijian Lu, Chew Lim Tan
BMCBI
2008
185views more  BMCBI 2008»
15 years 4 months ago
Mining clinical relationships from patient narratives
Background: The Clinical E-Science Framework (CLEF) project has built a system to extract clinically significant information from the textual component of medical records in order...
Angus Roberts, Robert J. Gaizauskas, Mark Hepple, ...
SIGIR
2008
ACM
15 years 4 months ago
Learning from labeled features using generalized expectation criteria
It is difficult to apply machine learning to new domains because often we lack labeled problem instances. In this paper, we provide a solution to this problem that leverages domai...
Gregory Druck, Gideon S. Mann, Andrew McCallum
ICDE
2012
IEEE
227views Database» more  ICDE 2012»
13 years 7 months ago
Horizontal Reduction: Instance-Level Dimensionality Reduction for Similarity Search in Large Document Databases
—Dimensionality reduction is essential in text mining since the dimensionality of text documents could easily reach several tens of thousands. Most recent efforts on dimensionali...
Min-Soo Kim 0001, Kyu-Young Whang, Yang-Sae Moon
WWW
2007
ACM
16 years 5 months ago
Web page classification with heterogeneous data fusion
Web pages are more than text and they contain much contextual and structural information, e.g., the title, the meta data, the anchor text, etc., each of which can be seen as a dat...
Zenglin Xu, Irwin King, Michael R. Lyu