Sciweavers

35 search results - page 1 / 7
» Learning Bigrams from Unigrams
Sort
View
56
Voted
ACL
2008
15 years 20 days ago
Learning Bigrams from Unigrams
Traditional wisdom holds that once documents are turned into bag-of-words (unigram count) vectors, word orders are completely lost. We introduce an approach that, perhaps surprisi...
Xiaojin Zhu, Andrew B. Goldberg, Michael Rabbat, R...
91
Voted
NIPS
2008
15 years 19 days ago
Correlated Bigram LSA for Unsupervised Language Model Adaptation
We present a correlated bigram LSA approach for unsupervised LM adaptation for automatic speech recognition. The model is trained using efficient variational EM and smoothed using...
Yik-Cheung Tam, Tanja Schultz
TREC
2003
15 years 17 days ago
SVM Approach to GeneRIF Annotation
In the biological domain, to extract the newly discovered functional features from massive literature is a major challenging issue. To automatically annotate GeneRIF in a new lite...
Wen-Juan Hou, Chun-Yuan Teng, Chih Lee, Hsin-Hsi C...
ACL
2006
15 years 18 days ago
Contextual Dependencies in Unsupervised Word Segmentation
Developing better methods for segmenting continuous text into words is important for improving the processing of Asian languages, and may shed light on how humans learn to segment...
Sharon Goldwater, Thomas L. Griffiths, Mark Johnso...
68
Voted
ICML
2006
IEEE
16 years 11 hour ago
Topic modeling: beyond bag-of-words
Some models of textual corpora employ text generation methods involving n-gram statistics, while others use latent topic variables inferred using the "bag-of-words" assu...
Hanna M. Wallach