Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

102

CEAS
2005
Springer

favoriteEmaildiscussreport

129views Internet Technology» more CEAS 2005»

Spam Deobfuscation using a Hidden Markov Model

15 years 6 months ago

Spam Deobfuscation using a Hidden Markov Model

Download www.stanford.edu

To circumvent spam ﬁlters, many spammers attempt to obfuscate their emails by deliberately misspelling words or introducing other errors into the text. For example viagra may be written vigra, or mortgage written m0rt gage. Even though humans have little diﬃculty reading obfuscated emails, most content-based ﬁlters are unable to recognize these obfuscated spam words. In this paper, we present a hidden Markov model for deobfuscating spam emails. We empirically demonstrate that our model is robust to many types of obfuscation including misspellings, incorrect segmentations (adding/removing spaces), and substitutions/insertions of non-alphabetic characters.

Honglak Lee, Andrew Y. Ng

Real-time Traffic

CEAS 2005 | Mortgage Written M0rt | Spam Emails | Spam ﬁlters |

claim paper

Related Content

» Dynamically Weighted Hidden Markov Model for Spam Deobfuscation

» An HMM for detecting spam mail

» Denoising and recognition using hidden Markov models with observation distributions modele...

» Markov Models for Automated ECG Interval Analysis

» Using Spam Farm to Boost PageRank

» Audio Imputation Using the Nonnegative Hidden Markov Model

» Robust visual tracking using autoregressive hidden Markov Model

» LexiconBased Word Recognition Using Support Vector Machine and Hidden Markov Model

» Modeling interleaved hidden processes

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	CEAS
Authors	Honglak Lee, Andrew Y. Ng

Comments (0)