Sciweavers

35 search results - page 1 / 7
» Learning Bigrams from Unigrams
Sort
View
52
Voted
ACL
2008
14 years 11 months ago
Learning Bigrams from Unigrams
Traditional wisdom holds that once documents are turned into bag-of-words (unigram count) vectors, word orders are completely lost. We introduce an approach that, perhaps surprisi...
Xiaojin Zhu, Andrew B. Goldberg, Michael Rabbat, R...
NIPS
2008
14 years 11 months ago
Correlated Bigram LSA for Unsupervised Language Model Adaptation
We present a correlated bigram LSA approach for unsupervised LM adaptation for automatic speech recognition. The model is trained using efficient variational EM and smoothed using...
Yik-Cheung Tam, Tanja Schultz
TREC
2003
14 years 11 months ago
SVM Approach to GeneRIF Annotation
In the biological domain, to extract the newly discovered functional features from massive literature is a major challenging issue. To automatically annotate GeneRIF in a new lite...
Wen-Juan Hou, Chun-Yuan Teng, Chih Lee, Hsin-Hsi C...
73
Voted
ACL
2006
14 years 11 months ago
Contextual Dependencies in Unsupervised Word Segmentation
Developing better methods for segmenting continuous text into words is important for improving the processing of Asian languages, and may shed light on how humans learn to segment...
Sharon Goldwater, Thomas L. Griffiths, Mark Johnso...
ICML
2006
IEEE
15 years 10 months ago
Topic modeling: beyond bag-of-words
Some models of textual corpora employ text generation methods involving n-gram statistics, while others use latent topic variables inferred using the "bag-of-words" assu...
Hanna M. Wallach