Sciweavers

CEAS
2006
Springer
13 years 8 months ago
Email Thread Reassembly Using Similarity Matching
Email thread reassembly is the task of linking messages by parentchild relationships. In this paper, we present two approaches to address this problem. One exploits previously und...
Jen-Yuan Yeh
CEAS
2006
Springer
13 years 8 months ago
Spam Filtering with Naive Bayes - Which Naive Bayes?
Naive Bayes is very popular in commercial and open-source anti-spam e-mail filters. There are, however, several forms of Naive Bayes, something the anti-spam literature does not a...
Vangelis Metsis, Ion Androutsopoulos, Georgios Pal...
CEAS
2006
Springer
13 years 8 months ago
Fast Uncertainty Sampling for Labeling Large E-mail Corpora
One of the biggest challenges in building effective anti-spam solutions is designing systems to defend against the everevolving bag of tricks spammers use to defeat them. Because ...
Richard Segal, Ted Markowitz, William Arnold
CEAS
2006
Springer
13 years 8 months ago
Dynamic Port 25 Blocking to Control SPAM Zombies
This paper presents the results of a case study in which outbound SPAM, here referring to excessive amounts of bulk-generated email, is suppressed using dynamic Port 25 blocking. ...
Jonathan Schmidt
CEAS
2006
Springer
13 years 8 months ago
Online Discriminative Spam Filter Training
We describe a very simple technique for discriminatively training a spam filter. Our results on the TREC Enron spam corpus would have been the best for the Ham at .1% measure, and...
Joshua Goodman, Wen-tau Yih
CEAS
2006
Springer
13 years 8 months ago
Annotating Subsets of the Enron Email Corpus
We present an annotation project for two subsets of the Enron email corpus. The first is a subset of the UC Berkeley Enron Email Analysis Project and the second consists of a port...
Jade Goldstein, Andres Kwasinksi, Paul Kingsbury, ...
CEAS
2006
Springer
13 years 8 months ago
Breaking Anti-Spam Systems with Parasitic Spam
The existance of networks of `bots' raises the possibility of a new type of spam that breaks the current paradigm of spam defense, in which the defence acts purely as a filte...
Morton Swimmer, Ian Whalley, Barry Leiba, Nathanie...
CEAS
2006
Springer
13 years 8 months ago
Algorithmically Determining Store-and-Forward MTA Relays Using DomainKeys
Store-and-forward MTA relaying servers have frequently presented problems to various antispam techniques, such as IPbased reputation or email authentication. Algorithms that find ...
Miles Libbey, Peter Ludemann
CEAS
2006
Springer
13 years 8 months ago
Learning at Low False Positive Rates
Most spam filters are configured for use at a very low falsepositive rate. Typically, the filters are trained with techniques that optimize accuracy or entropy, rather than perfor...
Wen-tau Yih, Joshua Goodman, Geoff Hulten