Feature selection for ranking using boosted trees

15 years 6 months ago

Download fengpan.net

Modern search engines have to be fast to satisfy users, so there are hard back-end latency requirements. The set of features useful for search ranking functions, though, continues to grow, making feature computation a latency bottleneck. As a result, not all available features can be used for ranking, and in fact, much of the time, only a small percentage of these features can be used. Thus, it is crucial to have a feature selection mechanism that can find a subset of features that both meets latency requirements and achieves high relevance. To this end, we explore different feature selection methods using boosted regression trees, including both greedy approaches (selecting the features with highest relative importance as computed by boosted trees; discounting importance by feature similarity and a randomized approach. We evaluate and compare these approaches using data from a commercial search engine. The experimental results show that the proposed randomized feature selection with ...

Feng Pan, Tim Converse, David Ahn, Franco Salvetti

Real-time Traffic

CIKM 2009 | Database | Feature Selection | Latency Requirements | Search Engine |

claim paper

Related Content

» Boosting for Document Routing

» Ranking Categorical Features Using Generalization Properties

» Coupling feature selection and machine learning methods for navigational query identificat...

» Online feature selection and classification

» Boosting recombined weak classifiers

» Retrieval and ranking of biomedical images using boosted haar features

» Gabor Feature Selection for Face Recognition Using Improved AdaBoost Learning

» Learning to associate HybridBoosted multitarget tracker for crowded scene

» Feature selection and nearest centroid classification for protein mass spectrometry

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	CIKM
Authors	Feng Pan, Tim Converse, David Ahn, Franco Salvetti, Gianluca Donato

Comments (0)

Sciweavers

Feature selection for ranking using boosted trees

CIKM 2009 | Database | Feature Selection | Latency Requirements | Search Engine |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers