Sciweavers

1527 search results - page 25 / 306
» Hidden word statistics
Sort
View
ICDAR
2011
IEEE
13 years 11 months ago
Co-training for Handwritten Word Recognition
—To cope with the tremendous variations of writing styles encountered between different individuals, unconstrained automatic handwriting recognition systems need to be trained on...
Volkmar Frinken, Andreas Fischer, Horst Bunke, Ali...
EMNLP
2009
14 years 9 months ago
Extending Statistical Machine Translation with Discriminative and Trigger-Based Lexicon Models
In this work, we propose two extensions of standard word lexicons in statistical machine translation: A discriminative word lexicon that uses sentence-level source information to ...
Arne Mauser, Sasa Hasan, Hermann Ney
EMNLP
2007
15 years 1 months ago
A Topic Model for Word Sense Disambiguation
We develop latent Dirichlet allocation with WORDNET (LDAWN), an unsupervised probabilistic topic model that includes word sense as a hidden variable. We develop a probabilistic po...
Jordan L. Boyd-Graber, David M. Blei, Xiaojin Zhu
ANLP
1994
134views more  ANLP 1994»
15 years 1 months ago
Degraded Text Recognition Using Word Collocation and Visual Inter-Word Constraints
Given a noisy text page, a word recognizer can generate a set of candidates for each word image. A relaxation algorithm was proposed previously by the authors that uses word collo...
Tao Hong, Jonathan J. Hull
ICDM
2007
IEEE
173views Data Mining» more  ICDM 2007»
15 years 6 months ago
Sparse Word Graphs: A Scalable Algorithm for Capturing Word Correlations in Topic Models
Statistical topic models such as the Latent Dirichlet Allocation (LDA) have emerged as an attractive framework to model, visualize and summarize large document collections in a co...
Ramesh Nallapati, Amr Ahmed, William W. Cohen, Eri...