Sciweavers

910 search results - page 164 / 182
» Standardization of Speech Corpus
Sort
View
INFORMATICALT
2006
116views more  INFORMATICALT 2006»
14 years 11 months ago
Cache-based Statistical Language Models of English and Highly Inflected Lithuanian
This paper investigates a variety of statistical cache-based language models built upon three corpora: English, Lithuanian, and Lithuanian base forms. The impact of the cache size,...
Airenas Vaiciunas, Gailius Raskinis
90
Voted
JAIR
2006
137views more  JAIR 2006»
14 years 11 months ago
Learning Sentence-internal Temporal Relations
In this paper we propose a data intensive approach for inferring sentence-internal temporal relations. Temporal inference is relevant for practical NLP applications which either e...
Maria Lapata, Alex Lascarides
89
Voted
CORR
2000
Springer
126views Education» more  CORR 2000»
14 years 10 months ago
Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach
We investigate the performance of two machine learning algorithms in the context of antispam filtering. The increasing volume of unsolicited bulk e-mail (spam) has generated a nee...
Ion Androutsopoulos, Georgios Paliouras, Vangelis ...
69
Voted
TASLP
2002
99views more  TASLP 2002»
14 years 10 months ago
A system for spoken query information retrieval on mobile devices
Abstract--With the proliferation of handheld devices, information access on mobile devices is a topic of growing relevance. This paper presents a system that allows the user to sea...
E. Chang, Frank Seide, Helen M. Meng, Zhuoran Chen...
CIKM
2010
Springer
14 years 9 months ago
Fast query expansion using approximations of relevance models
Pseudo-relevance feedback (PRF) improves search quality by expanding the query using terms from high-ranking documents from an initial retrieval. Although PRF can often result in ...
Marc-Allen Cartright, James Allan, Victor Lavrenko...