Sciweavers

PAKDD
2005
ACM

Improved Bayesian Spam Filtering Based on Co-weighted Multi-area Information

13 years 9 months ago
Improved Bayesian Spam Filtering Based on Co-weighted Multi-area Information
Abstract. Bayesian spam filters, in general, compute probability estimations for tokens either without considering the email areas of occurrences except the body or treating the same token occurred in different areas as different tokens. However, in reality the same token occurring in different areas are inter-related and the relation too could play role in the classification. In this paper we incorporated this novel idea, co-relating multi-area information by co-weighting them and obtaining more effective combined integrated probability estimations for tokens. The new approach is compared with individual area-wise estimations and traditional separate estimations in all areas, and the experimental results with three public corpora showed significant improvement, stability, robustness and consistency in the spam filtering with the proposed estimation.
Raju Shrestha, Yaping Lin
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where PAKDD
Authors Raju Shrestha, Yaping Lin
Comments (0)