Sciweavers

1527 search results - page 128 / 306
» Hidden word statistics
Sort
View
84
Voted
COLING
1996
15 years 2 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka
ICASSP
2011
IEEE
14 years 5 months ago
Recent development of discriminative training using non-uniform criteria for cross-level acoustic modeling
In this paper, we extend our previous study on discriminative training using non-uniform criteria for speech recognition. The work will put emphasis on how the acoustic modeling i...
Chao Weng, Biing-Hwang Juang
GFKL
2006
Springer
78views Data Mining» more  GFKL 2006»
15 years 5 months ago
Putting Successor Variety Stemming to Work
Stemming algorithms find canonical forms for inflected words, e. g. for declined nouns or conjugated verbs. Since such a unification of words with respect to gender, number, time, ...
Benno Stein, Martin Potthast
DAS
2006
Springer
15 years 5 months ago
Aligning Transcripts to Automatically Segmented Handwritten Manuscripts
Abstract. Training and evaluation of techniques for handwriting recognition and retrieval is a challenge given that it is difficult to create large ground-truthed datasets. This is...
Jamie L. Rothfeder, R. Manmatha, Toni M. Rath
EMNLP
2008
15 years 3 months ago
Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems
This paper presents a new hypothesis alignment method for combining outputs of multiple machine translation (MT) systems. An indirect hidden Markov model (IHMM) is proposed to add...
Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguye...