Sciweavers

35 search results - page 1 / 7
» Learning Bigrams from Unigrams
Sort
View
ACL
2008
13 years 6 months ago
Learning Bigrams from Unigrams
Traditional wisdom holds that once documents are turned into bag-of-words (unigram count) vectors, word orders are completely lost. We introduce an approach that, perhaps surprisi...
Xiaojin Zhu, Andrew B. Goldberg, Michael Rabbat, R...
NIPS
2008
13 years 6 months ago
Correlated Bigram LSA for Unsupervised Language Model Adaptation
We present a correlated bigram LSA approach for unsupervised LM adaptation for automatic speech recognition. The model is trained using efficient variational EM and smoothed using...
Yik-Cheung Tam, Tanja Schultz
TREC
2003
13 years 6 months ago
SVM Approach to GeneRIF Annotation
In the biological domain, to extract the newly discovered functional features from massive literature is a major challenging issue. To automatically annotate GeneRIF in a new lite...
Wen-Juan Hou, Chun-Yuan Teng, Chih Lee, Hsin-Hsi C...
ACL
2006
13 years 6 months ago
Contextual Dependencies in Unsupervised Word Segmentation
Developing better methods for segmenting continuous text into words is important for improving the processing of Asian languages, and may shed light on how humans learn to segment...
Sharon Goldwater, Thomas L. Griffiths, Mark Johnso...
ICML
2006
IEEE
14 years 5 months ago
Topic modeling: beyond bag-of-words
Some models of textual corpora employ text generation methods involving n-gram statistics, while others use latent topic variables inferred using the "bag-of-words" assu...
Hanna M. Wallach