Sciweavers

492 search results - page 92 / 99
» trec 2008
Sort
View
KDD
2009
ACM
170views Data Mining» more  KDD 2009»
15 years 10 months ago
Genre-based decomposition of email class noise
Corruption of data by class-label noise is an important practical concern impacting many classification problems. Studies of data cleaning techniques often assume a uniform label ...
Aleksander Kolcz, Gordon V. Cormack
CEAS
2008
Springer
14 years 11 months ago
A Mail Client Plugin for Privacy-Preserving Spam Filter Evaluation
We describe a plugin extension to the Thunderbird Mail Client to support standardized evaluation of multiple spam filters on private mail streams. Researchers need not view or han...
Mona Mojdeh, Gordon V. Cormack
ADC
2009
Springer
140views Database» more  ADC 2009»
15 years 4 months ago
Score Aggregation Techniques in Retrieval Experimentation
Comparative evaluations of information retrieval systems are based on a number of key premises, including that representative topic sets can be created, that suitable relevance ju...
Sri Devi Ravana, Alistair Moffat
AAAI
2008
14 years 11 months ago
Concept-Based Feature Generation and Selection for Information Retrieval
Traditional information retrieval systems use query words to identify relevant documents. In difficult retrieval tasks, however, one needs access to a wealth of background knowled...
Ofer Egozi, Evgeniy Gabrilovich, Shaul Markovitch
AUSDM
2008
Springer
243views Data Mining» more  AUSDM 2008»
14 years 11 months ago
Structure-Based Document Model with Discrete Wavelet Transforms and Its Application to Document Classification
Term signal is an existing text representation that depicts a term as a vector of frequencies of occurrences in a number of user-defined partitions of a document. Although term si...
Supphachai Thaicharoen, Tom Altman, Krzysztof J. C...