Sciweavers

ACL
2011
12 years 8 months ago
Using Derivation Trees for Treebank Error Detection
This work introduces a new approach to checking treebank consistency. Derivation trees based on a variant of Tree Adjoining Grammar are used to compare the annotation of word sequ...
Seth Kulick, Ann Bies, Justin Mott
ICASSP
2011
IEEE
12 years 8 months ago
Automatically finding semantically consistent n-grams to add new words in LVCSR systems
This paper presents a new method to automatically add n-grams containing out-of-vocabulary (OOV) words to a baseline language model (LM), where these n-grams are sought to be gram...
Gwénolé Lecorvé, Guillaume Gr...
ACL
1996
13 years 5 months ago
The Rhythm of Lexical Stress in Prose
\Prose rhythm" is a widely observed but scarcely quanti ed phenomenon. We describe an information-theoretic model for measuring the regularity of lexical stress in English te...
Doug Beeferman
NIPS
2000
13 years 5 months ago
A Neural Probabilistic Language Model
A goal of statistical language modeling is to learn the joint probability function of sequences of words in a language. This is intrinsically difficult because of the curse of dim...
Yoshua Bengio, Réjean Ducharme, Pascal Vinc...
EACL
2003
ACL Anthology
13 years 5 months ago
Detecting Novel Compounds: The Role of Distributional Evidence
Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurrence in a corpus is indicative of their terminological status, and has not addr...
Mirella Lapata, Alex Lascarides
INEX
2007
Springer
13 years 10 months ago
Phrase Detection in the Wikipedia
The Wikipedia XML collection turned out to be rich of marked-up phrases as we carried out our INEX 2007 experiments. Assuming that a phrase occurs at the inline level of the markup...
Miro Lehtonen, Antoine Doucet