Sciweavers

1820 search results - page 40 / 364
» Hierarchical Clustering of Words
Sort
View
ACL
2008
15 years 18 days ago
Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation
In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...
Jakob Uszkoreit, Thorsten Brants
INTERSPEECH
2010
14 years 6 months ago
Decision tree state clustering with word and syllable features
In large vocabulary continuous speech recognition, decision trees are widely used to cluster triphone states. In addition to commonly used phonetically based questions, others hav...
Hank Liao, Christopher Alberti, Michiel Bacchiani,...
97
Voted
CORR
1998
Springer
87views Education» more  CORR 1998»
14 years 10 months ago
Word Clustering and Disambiguation Based on Co-occurrence Data
We address the problem of clustering words (or constructing a thesaurus) based on co-occurrence data, and using the acquired word classes to improve the accuracy of syntactic disa...
Hang Li, Naoki Abe
ICPR
2010
IEEE
14 years 9 months ago
Word Clustering Using PLSA Enhanced with Long Distance Bigrams
Probabilistic latent semantic analysis is enhanced with long distance bigram models in order to improve word clustering. The long distance bigram probabilities and the interpolate...
Nikoletta Bassiou, Constantine Kotropoulos
IJCAI
2001
15 years 15 days ago
Combining Statistics and Semantics for Word and Document Clustering
A new approach for constructing pseudo-keywords, referred to as Sense Units, is proposed. Sense Units are obtained by a word clustering process, where the underlying similarity re...
Alexandre Termier, Michèle Sebag, Marie-Chr...