Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

14

AIPRF
2007

favoriteEmaildiscussreport

116views Artificial Intelligence» more AIPRF 2007»

Evaluation of Different Approaches to Training a Genre Classifier

13 years 5 months ago

Evaluation of Different Approaches to Training a Genre Classifier

Download dis.ijs.si

This paper presents experiments on classifying web pages by genre. Firstly, a corpus of 1539 manually labeled web pages was prepared. Secondly, 502 genre features were selected based on the literature and the observation of the corpus. Thirdly, these features were extracted from the corpus to obtain a data set. Finally, three machine learning algorithms, one for induction of decision trees (J48) and two ensemble algorithms (bagging and boosting), were trained and tested on the data set. Additionally, impact of feature selection on ensemble algorithms was tested. The best performed genre classifiers in terms of precision were selected to obtain the best of set of classifiers. On average the best of set achieved 9% better precision, but slightly worse recall. Accuracy and F-measure did not vary significantly. The results indicate that classification by genre could be a useful addition to search engines.

Vedrana Vidulin, Mitja Lustrek, Matjaz Gams

Real-time Traffic

AIPRF 2007 | Artificial Intelligence | Data Set | Ensemble Algorithms | Web Pages |

claim paper

Related Content

» Learning to classify documents according to genre

» A Comparison of Stylometric and Lexical Features for Web Genre Classification and Emotion ...

» Selection of Training Instances for Music Genre Classification

» Musical genre classification of audio signals

» Partofspeech histograms for genre classification of text

» When Specialists and Generalists Work Together Overcoming Domain Dependence in Sentiment T...

» Automatic Musical Genre Classification of Audio Signals

» Classification of musical genre a machine learning approach

» NonNegative Tensor Factorization Applied to Music Genre Classification

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	AIPRF
Authors	Vedrana Vidulin, Mitja Lustrek, Matjaz Gams

Comments (0)