Sciweavers

1526 search results - page 202 / 306
» Low-cost and robust evaluation of information retrieval syst...
Sort
View
CAISE
2009
Springer
15 years 8 months ago
Measuring and Comparing Effectiveness of Data Quality Techniques
Abstract. Poor quality data may be detected and corrected by performing various quality assurance activities that rely on techniques with different efficacy and cost. In this pape...
Lei Jiang, Daniele Barone, Alexander Borgida, John...
SIGIR
2010
ACM
15 years 5 months ago
Crowdsourcing a wikipedia vandalism corpus
We report on the construction of the PAN Wikipedia vandalism corpus, PAN-WVC-10, using Amazon’s Mechanical Turk. The corpus compiles 32 452 edits on 28 468 Wikipedia articles, a...
Martin Potthast
KDD
2004
ACM
114views Data Mining» more  KDD 2004»
16 years 1 months ago
Mining reference tables for automatic text segmentation
Automatically segmenting unstructured text strings into structured records is necessary for importing the information contained in legacy sources and text collections into a data ...
Eugene Agichtein, Venkatesh Ganti
TREC
2004
15 years 2 months ago
Overview of the TREC 2004 Question Answering Track
The TREC 2004 Question Answering track contained a single task in which question series were used to define a set of targets. Each series contained factoid and list questions and ...
Ellen M. Voorhees
ICCCI
2009
Springer
15 years 6 months ago
On Deriving Tagsonomies: Keyword Relations Coming from Crowd
Abstract. Many keyword-based approaches to text classification, information retrieval or even user modeling for adaptive web-based system could benefit from knowledge on relation...
Michal Barla, Mária Bieliková