Search Sciweavers | Sciweavers

466 search results - page 21 / 94

» Scalable Feature Extraction from Noisy Documents

165

click to vote

ICDAR
2003
IEEE

265views Document Analysis» more ICDAR 2003»

Localization, Extraction and Recognition of Text in Telugu Document Images

15 years 11 months ago

Download www.hserus.net

In this paper we present a system to locate, extract and recognize Telugu text. The circular nature of Telugu script is exploited for segmenting text regions using the Hough Trans...

Atul Negi, K. Nikhil Shanker, Chandra Kanth Chered...

claim paper

Read More »

160

click to vote

SIGIR
2006
ACM

133views Information Technology» more SIGIR 2006»

Feature diversity in cluster ensembles for robust document clustering

15 years 12 months ago

Download serpens.salleurl.edu

The performance of document clustering systems depends on employing optimal text representations, which are not only diﬃcult to determine beforehand, but also may vary from one ...

Xavier Sevillano, Germán Cobo, Francesc Al&...

claim paper

Read More »

194

click to vote

COLING
2010

160views Computational Linguistics» more COLING 2010»

Shallow Information Extraction from Medical Forum Data

15 years 29 days ago

Download nlp.cs.illinois.edu

We study a novel shallow information extraction problem that involves extracting sentences of a given set of topic categories from medical forum data. Given a corpus of medical fo...

Parikshit Sondhi, Manish Gupta, ChengXiang Zhai, J...

claim paper

Read More »

191

click to vote

SIGIR
2003
ACM

147views Information Technology» more SIGIR 2003»

Text categorization by boosting automatically extracted concepts

15 years 11 months ago

Download www.cs.brown.edu

Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...

Lijuan Cai, Thomas Hofmann

claim paper

Read More »

186

click to vote

ANLP
1994

104views more ANLP 1994»

Language Determination: Natural Language Processing from Scanned Document Images

15 years 7 months ago

Download acl.ldc.upenn.edu

Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...

Penelope Sibun, A. Lawrence Spitz

claim paper

Read More »

« Prev « First page 21 / 94 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers