Sciweavers

2827 search results - page 216 / 566
» Marking Text Documents
Sort
View
TREC
2007
15 years 7 months ago
On Retrieving Legal Files: Shortening Documents and Weeding Out Garbage
This paper describes our participation in the TREC Legal experiments in 2007. We have applied novel normalization techniques that are designed to slightly favor longer documents i...
Scott Kulp, April Kontostathis
JMM2
2007
100views more  JMM2 2007»
15 years 6 months ago
On Separation of English Numerals from Multilingual Document Images
— For Optical Character Recognition (OCR) of bilingual or multilingual document containing text words in regional language and numerals in English, it is necessary to identify di...
Basanna V. Dhandra, Mallikarjun Hangarge
ACL
2010
15 years 4 months ago
Unsupervised Discourse Segmentation of Documents with Inherently Parallel Structure
Documents often have inherently parallel structure: they may consist of a text and ries, or an abstract and a body, or parts presenting alternative views on the same problem. Reve...
Minwoo Jeong, Ivan Titov
ICFHR
2010
164views Biometrics» more  ICFHR 2010»
15 years 28 days ago
Alpha-Numerical Sequences Extraction in Handwritten Documents
In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contra...
Simon Thomas, Clément Chatelain, Laurent He...
COLING
2000
15 years 7 months ago
Text Genre Detection Using Common Word Frequencies
In this paper we present a method for detecting the text genre quickly and easily following an approach originally proposed in authorship attribution studies which uses as style m...
Efstathios Stamatatos, Nikos Fakotakis, George K. ...