Sciweavers

WWW
2009
ACM

Large scale multi-label classification via metalabeler

14 years 5 months ago
Large scale multi-label classification via metalabeler
The explosion of online content has made the management of such content non-trivial. Web-related tasks such as web page categorization, news filtering, query categorization, tag recommendation, etc. often involve the construction of multilabel categorization systems on a large scale. Existing multilabel classification methods either do not scale or have unsatisfactory performance. In this work, we propose MetaLabeler to automatically determine the relevant set of labels for each instance without intensive human involvement or expensive cross-validation. Extensive experiments conducted on benchmark data show that the MetaLabeler tends to outperform existing methods. Moreover, MetaLabeler scales to millions of multi-labeled instances and can be deployed easily. This enables us to apply the MetaLabeler to a large scale query categorization problem in Yahoo!, yielding a significant improvement in performance. Categories and Subject Descriptors H.2.8 [Database Management]: Database applica...
Lei Tang, Suju Rajan, Vijay K. Narayanan
Added 21 Nov 2009
Updated 21 Nov 2009
Type Conference
Year 2009
Where WWW
Authors Lei Tang, Suju Rajan, Vijay K. Narayanan
Comments (0)