Sciweavers

735 search results - page 33 / 147
» Corpora and data preparation
Sort
View
PRL
2002
67views more  PRL 2002»
14 years 9 months ago
A pseudo-nearest-neighbor approach for missing data recovery on Gaussian random data sets
Missing data handling is an important preparation step for most data discrimination or mining tasks. Inappropriate treatment of missing data may cause large errors or false result...
Xiaolu Huang, Qiuming Zhu
CORR
2007
Springer
134views Education» more  CORR 2007»
14 years 10 months ago
Web data modeling for integration in data warehouses
In a data warehousing process, the data preparation phase is crucial. Mastering this phase allows substantial gains in terms of time and performance when performing a multidimensio...
Sami Miniaoui, Jérôme Darmont, Omar B...
SIGMOD
2009
ACM
142views Database» more  SIGMOD 2009»
15 years 10 months ago
A grammar-based entity representation framework for data cleaning
Fundamental to data cleaning is the need to account for multiple data representations. We propose a formal framework that can be used to reason about and manipulate data represent...
Arvind Arasu, Raghav Kaushik
CLEAR
2007
Springer
154views Biometrics» more  CLEAR 2007»
15 years 4 months ago
The LIMSI RT07 Lecture Transcription System
A system to automatically transcribe lectures and presentations has been developed in the context of the FP6 Integrated Project CHIL. In addition to the seminar data recorded by th...
Lori Lamel, Eric Bilinski, Jean-Luc Gauvain, Gille...
LREC
2010
180views Education» more  LREC 2010»
14 years 11 months ago
A Comprehensive Resource to Evaluate Complex Open Domain Question Answering
We describe two corpora of question and answer pairs collected for complex, open-domain Question Answering (QA) to enable answer classification and re-ranking experiments. We deli...
Silvia Quarteroni, Alessandro Moschitti