Sciweavers

735 search results - page 89 / 147
» Corpora and data preparation
Sort
View
COLING
2002
14 years 9 months ago
Recovering Latent Information in Treebanks
Many recent statistical parsers rely on a preprocessing step which uses hand-written, corpus-specific rules to augment the training data with extra information. For example, head-...
David Chiang, Daniel M. Bikel
COLING
2002
14 years 9 months ago
Extraposition: A Case Study in German Sentence Realization
We profile the occurrence of clausal extraposition in corpora from different domains and demonstrate that extraposition is a pervasive phenomenon in German that must be addressed ...
Michael Gamon, Eric K. Ringger, Zhu Zhang, Robert ...
COLING
2002
14 years 9 months ago
Hierarchical Orderings of Textual Units
Text representation is a central task for any approach to automatic learning from texts. It requires a format which allows to interrelate texts even if they do not share content w...
Alexander Mehler
CORR
2000
Springer
129views Education» more  CORR 2000»
14 years 9 months ago
Prosody-Based Automatic Segmentation of Speech into Sentences and Topics
A crucial step in processing speech audio data for information extraction, topic detection, or browsing/playback is to segment the input into sentence and topic units. Speech segm...
Elizabeth Shriberg, Andreas Stolcke, Dilek Z. Hakk...
CORR
1998
Springer
79views Education» more  CORR 1998»
14 years 9 months ago
Can Subcategorisation Probabilities Help a Statistical Parser?
Research into the automatic acquisition of lexical information from corpora is starting to produce large-scale computational lexicons containing data on the relative frequencies o...
John Carroll, Guido Minnen, Ted Briscoe