Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

28

SEBD
2008

favoriteEmaildiscussreport

177views Database» more SEBD 2008»

Using PageRank in Feature Selection

13 years 10 months ago

Using PageRank in Feature Selection

Download www.di.unito.it

Abstract. Feature selection is an important task in data mining because it allows to reduce the data dimensionality and eliminates the noisy variables. Traditionally, feature selection has been applied in supervised scenarios rather than in unsupervised ones. Nowadays, the amount of unsupervised data available on the web is huge, thus motivating an increasing interest in feature selection for unsupervised data. In this paper we present some results in the domain of document categorization. We use the well-known PageRank algorithm to perform a random-walk through the feature space of the documents. This allows to rank and subsequently choose those features that better represent the data set. When compared with previous work based on information gain, our method allows classifiers to obtain good accuracy especially when few features are retained.

Dino Ienco, Rosa Meo, Marco Botta

Real-time Traffic

Database | Feature Selection | SEBD 2008 | Unsupervised Data | Unsupervised Ones |

claim paper

Related Content

» Using Hyperlink Features to Personalize Web Search

» Robust PageRank and locally computable spam detection features

» Resource Discovery Using PageRank Technique in Grid Environment

» Markov Logic Sets Towards Lifted Information Retrieval Using PageRank and Label Propagatio...

» Beyond PageRank machine learning for static ranking

» PageRank and the random surfer model

» Index Design for Dynamic Personalized PageRank

» An InnerOuter Stationary Iteration for Computing PageRank

» An InnerOuter Iteration for Computing PageRank

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2008
Where	SEBD
Authors	Dino Ienco, Rosa Meo, Marco Botta

Comments (0)