Search Sciweavers | Sciweavers

154

DAS
2010
Springer

303views Document Analysis» more DAS 2010»

Binarization of historical document images using the local maximum and minimum

15 years 9 months ago

This paper presents a new document image binarization technique that segments the text from badly degraded historical document images. The proposed technique makes use of the imag...

Bolan Su, Shijian Lu, Chew Lim Tan

claim paper

Read More »

142

click to vote

BMCBI
2008

185views more BMCBI 2008»

Mining clinical relationships from patient narratives

15 years 4 months ago

Download www.biomedcentral.com

Background: The Clinical E-Science Framework (CLEF) project has built a system to extract clinically significant information from the textual component of medical records in order...

Angus Roberts, Robert J. Gaizauskas, Mark Hepple, ...

claim paper

Read More »

148

click to vote

SIGIR
2008
ACM

150views Information Technology» more SIGIR 2008»

Learning from labeled features using generalized expectation criteria

15 years 4 months ago

Download www.cs.umass.edu

It is difficult to apply machine learning to new domains because often we lack labeled problem instances. In this paper, we provide a solution to this problem that leverages domai...

Gregory Druck, Gideon S. Mann, Andrew McCallum

claim paper

Read More »

183

click to vote

ICDE
2012
IEEE

227views Database» more ICDE 2012»

Horizontal Reduction: Instance-Level Dimensionality Reduction for Similarity Search in Large Document Databases

13 years 7 months ago

Download dblab.kaist.ac.kr

—Dimensionality reduction is essential in text mining since the dimensionality of text documents could easily reach several tens of thousands. Most recent efforts on dimensionali...

Min-Soo Kim 0001, Kyu-Young Whang, Yang-Sae Moon

claim paper

Read More »

116

click to vote

WWW
2007
ACM

138views Internet Technology» more WWW 2007»

Web page classification with heterogeneous data fusion

16 years 5 months ago

Download www.cse.cuhk.edu.hk

Web pages are more than text and they contain much contextual and structural information, e.g., the title, the meta data, the anchor text, etc., each of which can be seen as a dat...

Zenglin Xu, Irwin King, Michael R. Lyu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers