Search Sciweavers | Sciweavers

19

ICGI
2010
Springer

161views Natural Language Processing» more ICGI 2010»

Enhanced Suffix Arrays as Language Models: Virtual k-Testable Languages

13 years 6 months ago

Abstract. In this article, we propose the use of suffix arrays to efficiently implement n-gram language models with practically unlimited size n. This approach, which is used with ...

Herman Stehouwer, Menno van Zaanen

claim paper

Read More »

13

click to vote

NAACL
1994

87views Computational Linguistics» more NAACL 1994»

On Using Written Language Training Data for Spoken Language Modeling

13 years 6 months ago

Download acl.ldc.upenn.edu

We attemped to improve recognition accuracy by reducing the inadequacies of the lexicon and language model. Specifically we address the following three problems: (1) the best size...

Richard M. Schwartz, Long Nguyen, Francis Kubala, ...

claim paper

Read More »

16

click to vote

CLEF
2004
Springer

143views Information Technology» more CLEF 2004»

UB at CLEF2004: Cross Language Information Retrieval Using Statistical Language Models

13 years 9 months ago

Download courses.unt.edu

This paper presents the results of the State University of New York at Buffalo (UB) in the Mono-lingual and Multi-lingual tasks at CLEF 2004. For these tasks we used an approach ba...

Miguel E. Ruiz, Munirathnam Srikanth

claim paper

Read More »

15

click to vote

ACL
2012

181views Computational Linguistics» more ACL 2012»

Deciphering Foreign Language by Combining Language Models and Context Vectors

11 years 7 months ago

Download www-i6.informatik.rwth-aachen.de

In this paper we show how to train statistical machine translation systems on reallife tasks using only non-parallel monolingual data from two languages. We present a modiﬁcatio...

Malte Nuhn, Arne Mauser, Hermann Ney

claim paper

Read More »

15

click to vote

ACL
2001

188views Computational Linguistics» more ACL 2001»

Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters

13 years 6 months ago

Download acl.ldc.upenn.edu

In this paper, a new language model, the Multi-Class Composite N-gram, is proposed to avoid a data sparseness problem for spoken language in that it is difficult to collect traini...

Hirofumi Yamamoto, Shuntaro Isogai, Yoshinori Sagi...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers