Sciweavers

IJDAR
2007
82views more  IJDAR 2007»
13 years 4 months ago
Finding structure in noisy text: topic classification and unsupervised clustering
Premkumar Natarajan, Rohit Prasad, Krishna Subrama...
IJDAR
2007
106views more  IJDAR 2007»
13 years 4 months ago
Investigation and modeling of the structure of texting language
Language usage over computer mediated discourses, like chats, emails and SMS texts, significantly differs from the standard form of the language. An urge towards shorter message l...
Monojit Choudhury, Rahul Saraf, Vijit Jain, Animes...
IJDAR
2007
77views more  IJDAR 2007»
13 years 4 months ago
Genre as noise: noise in genre
Given a specific information need, documents of the wrong genre can be considered as noise. From this perspective, genre classification helps to separate relevant documents from...
Andrea Stubbe, Christoph Ringlstetter, Klaus U. Sc...
IJDAR
2007
100views more  IJDAR 2007»
13 years 4 months ago
Treebanks gone bad
This paper describes how a treebank of ungrammatical sentences can be created from a treebank of well-formed sentences. The treebank creation pro
Jennifer Foster
IJDAR
2007
62views more  IJDAR 2007»
13 years 4 months ago
Biblio: automatic meta-data extraction
Carl Staelin, Michael Elad, Darryl Greig, Oded Shm...
IJDAR
2007
52views more  IJDAR 2007»
13 years 4 months ago
Using colour information to understand censorship cards of film archives
Oronzo Altamura, Margherita Berardi, Michelangelo ...
IJDAR
2007
69views more  IJDAR 2007»
13 years 4 months ago
User-driven page layout analysis of historical printed books
In this paper, based on the study of the specificity of historical printed books, we first explain the main error sources in classical methods used for page layout analysis. We sho...
Jean-Yves Ramel, S. Leriche, M. L. Demonet, S. Bus...
IJDAR
2007
93views more  IJDAR 2007»
13 years 4 months ago
Word spotting for historical documents
Toni M. Rath, R. Manmatha
IJDAR
2007
127views more  IJDAR 2007»
13 years 4 months ago
Word matching using single closed contours for indexing handwritten historical documents
Abstract. Effective indexing is crucial for providing convenient access to scanned versions of large collections of handwritten historical manuscripts. Since traditional handwritin...
Tomasz Adamek, Noel E. O'Connor, Alan F. Smeaton