Sciweavers

35 search results - page 1 / 7
» Learning Bigrams from Unigrams
Sort
View
60
Voted
ACL
2008
15 years 1 months ago
Learning Bigrams from Unigrams
Traditional wisdom holds that once documents are turned into bag-of-words (unigram count) vectors, word orders are completely lost. We introduce an approach that, perhaps surprisi...
Xiaojin Zhu, Andrew B. Goldberg, Michael Rabbat, R...
94
Voted
NIPS
2008
15 years 1 months ago
Correlated Bigram LSA for Unsupervised Language Model Adaptation
We present a correlated bigram LSA approach for unsupervised LM adaptation for automatic speech recognition. The model is trained using efficient variational EM and smoothed using...
Yik-Cheung Tam, Tanja Schultz
TREC
2003
15 years 1 months ago
SVM Approach to GeneRIF Annotation
In the biological domain, to extract the newly discovered functional features from massive literature is a major challenging issue. To automatically annotate GeneRIF in a new lite...
Wen-Juan Hou, Chun-Yuan Teng, Chih Lee, Hsin-Hsi C...
ACL
2006
15 years 1 months ago
Contextual Dependencies in Unsupervised Word Segmentation
Developing better methods for segmenting continuous text into words is important for improving the processing of Asian languages, and may shed light on how humans learn to segment...
Sharon Goldwater, Thomas L. Griffiths, Mark Johnso...
71
Voted
ICML
2006
IEEE
16 years 20 days ago
Topic modeling: beyond bag-of-words
Some models of textual corpora employ text generation methods involving n-gram statistics, while others use latent topic variables inferred using the "bag-of-words" assu...
Hanna M. Wallach