Unsupervised linear score normalization revisited

11 years 7 months ago

Download www.aviarampatzis.com

We give a fresh look into score normalization for merging result-lists, isolating the problem from other components. We focus on three of the simplest, practical, and widelyused linear methods which do not require any training data, i.e. MinMax, Sum, and Z-Score. We provide theoretical arguments on why and when the methods work, and evaluate them experimentally. We ﬁnd that MinMax is the most robust under many circumstances, and that Sum is— in contrast to previous literature—the worst. Based on the insights gained, we propose another three simple methods which work as good or better than the baselines. Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval Keywords Score Normalization, Distributed Retrieval

Ilya Markov, Avi Arampatzis, Fabio Crestani

Real-time Traffic

Information Storage And Retrieval | Information Technology | Linear Methods | SIGIR 2012 | Z Score |

claim paper

Post Info
More Details (n/a)

Added	28 Sep 2012
Updated	28 Sep 2012
Type	Journal
Year	2012
Where	SIGIR
Authors	Ilya Markov, Avi Arampatzis, Fabio Crestani

Comments (0)

Sciweavers

Unsupervised linear score normalization revisited

Information Storage And Retrieval | Information Technology | Linear Methods | SIGIR 2012 | Z Score |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers