Sciweavers

3371 search results - page 24 / 675
» Using parsimonious language models on web data
Sort
View
ACL
2001
14 years 11 months ago
Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters
In this paper, a new language model, the Multi-Class Composite N-gram, is proposed to avoid a data sparseness problem for spoken language in that it is difficult to collect traini...
Hirofumi Yamamoto, Shuntaro Isogai, Yoshinori Sagi...
LREC
2008
146views Education» more  LREC 2008»
14 years 11 months ago
On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems
Language models used in current automatic speech recognition systems are trained on general-purpose corpora and are therefore not relevant to transcribe spoken documents dealing w...
Gwénolé Lecorvé, Guillaume Gr...
ESCIENCE
2006
IEEE
15 years 1 months ago
ODIN: A Model for Adapting and Enriching Legacy Infrastructure
The Online Database of Interlinear Text (ODIN)1 is a database of interlinear text "snippets", harvested mostly from scholarly documents posted to the Web. Although large...
William D. Lewis
BMCBI
2007
167views more  BMCBI 2007»
14 years 9 months ago
AlzPharm: integration of neurodegeneration data using RDF
Background: Neuroscientists often need to access a wide range of data sets distributed over the Internet. These data sets, however, are typically neither integrated nor interopera...
Hugo Y. K. Lam, Luis N. Marenco, Tim Clark, Yong G...
NIPS
2001
14 years 11 months ago
Model Based Population Tracking and Automatic Detection of Distribution Changes
Probabilistic mixture models are used for a broad range of data analysis tasks such as clustering, classification, predictive modeling, etc. Due to their inherent probabilistic na...
Igor V. Cadez, Paul S. Bradley