Background: Feature selection plays an undeniably important role in classification problems involving high dimensional datasets such as microarray datasets. For filter-based featu...
When data resides on tertiary storage, clustering is the key to achieving high retrieval performance. However, a straightforward approach to clustering massive amounts of data on ...
The core task of sponsored search is to retrieve relevant ads for the user’s query. Ads can be retrieved either by exact match, when their bid term is identical to the query, or...
Michael Bendersky, Evgeniy Gabrilovich, Vanja Josi...
Modern scientific applications consume massive volumes of data produced by computer simulations. Such applications require new data management capabilities in order to scale to te...
ABSTRACT In this paper, we report our experiments using a realworld image dataset to examine the effectiveness of Isomap, LLE and KPCA. The 1,897-image dataset we used consists of ...
Mei-Chen Yeh, I-Hsiang Lee, Gang Wu, Yi Wu, Edward...