Detecting Word Substitutions in Text

9 years 10 months ago
Detecting Word Substitutions in Text
Searching for words on a watchlist is one way in which large-scale surveillance of communication can be done, for example, in intelligence and counterterrorism settings. One obvious defense is to replace words that might attract attention to a message with other more innocuous words. For example, the sentence "the attack will be tomorrow" might be altered to "the complex will be tomorrow," since "complex" is a word whose frequency is close to that of "attack." Such substitutions are readily detectable by humans since they do not make sense. We address the problem of detecting such substitutions automatically by looking for discrepancies between words and their contexts and using only syntactic information. We define a set of measures, each of which is quite weak, but which together produce per-sentence detection rates around 90 percent with false positive rates around 10 percent. Rules for combining per-sentence detection into per-message detecti...
SzeWang Fong, Dmitri Roussinov, David B. Skillicor
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2008
Where TKDE
Authors SzeWang Fong, Dmitri Roussinov, David B. Skillicorn
Comments (0)