Search Sciweavers | Sciweavers

98

ACL
2008

168views Computational Linguistics» more ACL 2008»

Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation

15 years 18 days ago

In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...

Jakob Uszkoreit, Thorsten Brants

claim paper

Read More »

103

click to vote

INTERSPEECH
2010

137views Signal Processing» more INTERSPEECH 2010»

Decision tree state clustering with word and syllable features

14 years 6 months ago

Download static.googleusercontent.com

In large vocabulary continuous speech recognition, decision trees are widely used to cluster triphone states. In addition to commonly used phonetically based questions, others hav...

Hank Liao, Christopher Alberti, Michiel Bacchiani,...

claim paper

Read More »

97

Voted

CORR
1998
Springer

87views Education» more CORR 1998»

Word Clustering and Disambiguation Based on Co-occurrence Data

14 years 10 months ago

Download acl.ldc.upenn.edu

We address the problem of clustering words (or constructing a thesaurus) based on co-occurrence data, and using the acquired word classes to improve the accuracy of syntactic disa...

Hang Li, Naoki Abe

claim paper

Read More »

69

click to vote

ICPR
2010
IEEE

145views Computer Vision» more ICPR 2010»

Word Clustering Using PLSA Enhanced with Long Distance Bigrams

14 years 9 months ago

Download www.icpr2010.org

Probabilistic latent semantic analysis is enhanced with long distance bigram models in order to improve word clustering. The long distance bigram probabilities and the interpolate...

Nikoletta Bassiou, Constantine Kotropoulos

claim paper

Read More »

95

click to vote

IJCAI
2001

113views Artificial Intelligence» more IJCAI 2001»

Combining Statistics and Semantics for Word and Document Clustering

15 years 15 days ago

Download sunsite.informatik.rwth-aachen.de

A new approach for constructing pseudo-keywords, referred to as Sense Units, is proposed. Sense Units are obtained by a word clustering process, where the underlying similarity re...

Alexandre Termier, Michèle Sebag, Marie-Chr...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers