Sciweavers

78 search results - page 4 / 16
» Learning Common Grammar from Multilingual Corpus
Sort
View
LREC
2008
88views Education» more  LREC 2008»
14 years 11 months ago
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
Tokenization is one of the initial steps done for almost any text processing task. It is not particularly recognized as a challenging task for English monolingual systems but it r...
Oana Frunza
JMLR
2010
192views more  JMLR 2010»
14 years 4 months ago
Inducing Tree-Substitution Grammars
Inducing a grammar from text has proven to be a notoriously challenging learning task despite decades of research. The primary reason for its difficulty is that in order to induce...
Trevor Cohn, Phil Blunsom, Sharon Goldwater
ACL
1998
14 years 11 months ago
Automatic Acquisition of Language Model based on Head-Dependent Relation between Words
Language modeling is to associate a sequence of words with a priori probability, which is a key part of many natural language applications such as speech recognition and statistic...
Seungmi Lee, Key-Sun Choi
LREC
2008
120views Education» more  LREC 2008»
14 years 11 months ago
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
We introduce the corpus of United States Congressional bills from 1947 to 1998 for use by language research communities. The U.S. Policy Agenda Legislation Corpus Volume 1 (USPALC...
Stephen Purpura, John Wilkerson, Dustin Hillard
BIRTHDAY
2009
Springer
15 years 4 months ago
Formal Grammars of Early Language
We propose to model the development of language by a series of formal grammars, accounting for the linguistic capacity of children at the very early stages of mastering language. T...
Shuly Wintner, Alon Lavie, Brian MacWhinney