Sciweavers

3 search results - page 1 / 1
» Subdomain Sensitive Statistical Parsing using Raw Corpora
Sort
View
73
Voted
LREC
2008
113views Education» more  LREC 2008»
15 years 10 days ago
Subdomain Sensitive Statistical Parsing using Raw Corpora
Modern statistical parsers are trained on large annotated corpora (treebanks). These treebanks usually consist of sentences addressing different subdomains (e.g. sports, politics,...
Barbara Plank, Khalil Sima'an
93
Voted
EMNLP
2007
15 years 10 days ago
Bootstrapping Feature-Rich Dependency Parsers with Entropic Priors
One may need to build a statistical parser for a new language, using only a very small labeled treebank together with raw text. We argue that bootstrapping a parser is most promis...
David A. Smith, Jason Eisner
FLAIRS
2003
15 years 8 days ago
Orthographic Case Restoration Using Supervised Learning Without Manual Annotation
One challenge in text processing is the treatment of case insensitive documents such as speech recognition results. The traditional approach is to re-train a language model exclud...
Cheng Niu, Wei Li 0003, Jihong Ding, Rohini K. Sri...