Sciweavers

ICASSP
2011
IEEE
12 years 8 months ago
Generating compound words with high order n-gram information in large vocabulary speech recognition systems
In this work we concentrate on generating compound words with high order n-gram information for speech recognition. In most existing compound words generation methods, only bi-gra...
Jie Zhou, Qin Shi, Yong Qin
ICASSP
2011
IEEE
12 years 8 months ago
Speaker diarization of meetings based on speaker role n-gram models
Speaker diarization of meeting recordings is generally based on acoustic information ignoring that meetings are instances of conversations. Several recent works have shown that th...
Fabio Valente, Deepu Vijayasenan, Petr Motlí...
SIGIR
2010
ACM
12 years 11 months ago
Web N-gram workshop 2010
The Web N-gram Workshop was held on July 23, 2010 in Geneva, Switzerland, in conjunction with the 33rd Annual ACM SIGIR Conference. The workshop brought together leaders in inform...
Chengxiang Zhai, Kuansan Wang, David Yarowsky, Ste...
EMNLP
2010
13 years 2 months ago
Storing the Web in Memory: Space Efficient Language Models with Constant Time Retrieval
We present three novel methods of compactly storing very large n-gram language models. These methods use substantially less space than all known approaches and allow n-gram probab...
David Guthrie, Mark Hepple
ACL
2010
13 years 2 months ago
Creating Robust Supervised Classifiers via Web-Scale N-Gram Data
In this paper, we systematically assess the value of using web-scale N-gram data in state-of-the-art supervised NLP classifiers. We compare classifiers that include or exclude fea...
Shane Bergsma, Emily Pitler, Dekang Lin
IPM
2007
114views more  IPM 2007»
13 years 4 months ago
s-grams: Defining generalized n-grams for information retrieval
For European languages, n-gram has proved to be the cost effective alternative to morphological processing during indexing task and it has been studied and analyzed extensively us...
Anni Järvelin, Antti Järvelin, Kalervo J...
ICMCS
2000
IEEE
84views Multimedia» more  ICMCS 2000»
13 years 9 months ago
A Study on N-Gram Indexing of Musical Features
Since only simple symbol-based manipulations are needed,n-gram indexingis used for naturallanguageswhere syntactic or semantic analyses are often difficult. Music, whose automatic...
Chi Lap Yip, Ben Kao
SIGIR
2003
ACM
13 years 9 months ago
Single n-gram stemming
Stemming can improve retrieval accuracy, but stemmers are language-specific. Character n-gram tokenization achieves many of the benefits of stemming in a language independent way,...
James Mayfield, Paul McNamee
CLEF
2004
Springer
13 years 10 months ago
Application of Variable Length N-Gram Vectors to Monolingual and Bilingual Information Retrieval
Our group in the Department of Informatics at the University of Oviedo has participated, for the first time, in two tasks at CLEF: monolingual (Russian) and bilingual (Spanish-to-E...
Daniel Gayo-Avello, Darío Álvarez Gu...
ICASSP
2008
IEEE
13 years 11 months ago
Language recognition with discriminative keyword selection
One commonly used approach for language recognition is to convert the input speech into a sequence of tokens such as words or phones and then to use these token sequences to deter...
Fred S. Richardson, William M. Campbell