Sciweavers

735 search results - page 35 / 147
» Corpora and data preparation
Sort
View
FCSC
2010
238views more  FCSC 2010»
14 years 7 months ago
Knowledge discovery through directed probabilistic topic models: a survey
Graphical models have become the basic framework for topic based probabilistic modeling. Especially models with latent variables have proved to be effective in capturing hidden str...
Ali Daud, Juanzi Li, Lizhu Zhou, Faqir Muhammad
ACL
1998
14 years 11 months ago
How Verb Subcategorization Frequencies Are Affected By Corpus Choice
The probabilistic relation between verbs and their arguments plays an important role in modern statistical parsers and supertaggers, and in psychological theories of language proc...
Douglas Roland, Daniel Jurafsky
ACL
2009
14 years 7 months ago
Unsupervised Multilingual Grammar Induction
We investigate the task of unsupervised constituency parsing from bilingual parallel corpora. Our goal is to use bilingual cues to learn improved parsing models for each language ...
Benjamin Snyder, Tahira Naseem, Regina Barzilay
INTERSPEECH
2010
14 years 4 months ago
Using dependency parsing and machine learning for factoid question answering on spoken documents
This paper presents our experiments in question answering for speech corpora. These experiments focus on improving the answer extraction step of the QA process. We present two app...
Pere Comas, Jordi Turmo, Lluís Màrqu...
HIS
2001
14 years 11 months ago
An Automated Report Generation Tool for the Data Understanding Phase
To successfully prepare and model data, the data miner needs to be aware of the properties of the data manifold. In this chapter, the outline of a tool for automatically generating...
Juha Vesanto, Jaakko Hollmén