Improved Phishing Detection using Model-Based Features

11 years 10 months ago
Improved Phishing Detection using Model-Based Features
Phishing emails are a real threat to internet communication and web economy. Criminals are trying to convince unsuspecting online users to reveal passwords, account numbers, social security numbers or other personal information. Filtering approaches using blacklists are not completely effective as about every minute a new phishing scam is created. We investigate the statistical filtering of phishing emails, where a classifier is trained on characteristic features of existing emails and subsequently is able to identify new phishing emails with different contents. We propose advanced email features generated by adaptively trained Dynamic Markov Chains and by novel latent Class-Topic Models. On a publicly available test corpus classifiers using these features are able to reduce the number of misclassified emails by two thirds compared to previous work. Using a recently proposed more expressive evaluation method we show that these results are statistically significant. In addition we succ...
André Bergholz, Jeong Ho Chang, Gerhard Paa
Added 12 Oct 2010
Updated 12 Oct 2010
Type Conference
Year 2008
Where CEAS
Authors André Bergholz, Jeong Ho Chang, Gerhard Paass, Frank Reichartz, Siehyun Strobel
Comments (0)