Sciweavers

CORIA
2006

Unnatural language detection

13 years 6 months ago
Unnatural language detection
In the context of web search engines, the escalation between ranking techniques and spamdexing techniques has led to the appearance of faked contents in web pages. If random sequences of keywords are easily detectable, web pages produced by dedicated content generators are a lot more difficult to detect. Motivated by search engines applications, we will focus on the problem of automatic unnatural language detection. We will study both syntactical and semantical aspects of this problem, and for both of them we will present probabilistic and symbolic approaches. R
Thomas Lavergne
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where CORIA
Authors Thomas Lavergne
Comments (0)