Sciweavers

EACL
2003
ACL Anthology

A Comparison of Event Models for Naive Bayes Anti-Spam E-Mail Filtering

13 years 5 months ago
A Comparison of Event Models for Naive Bayes Anti-Spam E-Mail Filtering
We describe experiments with a Naive Bayes text classifier in the context of anti-spam E-mail filtering, using two different statistical event models: a multi-variate Bernoulli model and a multinomial model. We introduce a family of feature ranking functions for feature selection in the multinomial event model that take account of the word frequency information. We present evaluation results on two publicly available corpora of legitimate and spam E-mails. We find that the multinomial model is less biased towards one class and achieves slightly higher accuracy than the multi-variate Bernoulli model.
Karl-Michael Schneider
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2003
Where EACL
Authors Karl-Michael Schneider
Comments (0)