Sciweavers

47 search results - page 9 / 10
» Text Degradations and OCR Training
Sort
View
PROPOR
2010
Springer
278views Languages» more  PROPOR 2010»
15 years 4 months ago
Translating from Complex to Simplified Sentences
We address the problem of simplifying Portuguese texts at the sentence level treating it as a "translation task". We use the Statistical Machine Translation (SMT) framewo...
Lucia Specia
75
Voted
SIGIR
2005
ACM
15 years 3 months ago
Boosted decision trees for word recognition in handwritten document retrieval
Recognition and retrieval of historical handwritten material is an unsolved problem. We propose a novel approach to recognizing and retrieving handwritten manuscripts, based upon ...
Nicholas R. Howe, Toni M. Rath, R. Manmatha
80
Voted
TRECVID
2007
14 years 10 months ago
PicSOM Experiments in TRECVID 2007
Our experiments in TRECVID 2007 include participation in the high-level feature extraction, search, and video summarization tasks, using a common system framework based on multipl...
Markus Koskela, Mats Sjöberg, Ville Viitaniem...
64
Voted
FLAIRS
2003
14 years 11 months ago
Orthographic Case Restoration Using Supervised Learning Without Manual Annotation
One challenge in text processing is the treatment of case insensitive documents such as speech recognition results. The traditional approach is to re-train a language model exclud...
Cheng Niu, Wei Li 0003, Jihong Ding, Rohini K. Sri...
LREC
2008
110views Education» more  LREC 2008»
14 years 11 months ago
Creation of Learner Corpus and Its Application to Speech Recognition
Some big languages like English are spoken by a lot of people whose mother tongues are different from. Their second languages often have not only distinct accent but also differen...
Hiroki Yamazaki, Keisuke Kitamura, Takashi Harada,...