Sciweavers

1527 search results - page 12 / 306
» Hidden word statistics
Sort
View
ICDAR
2003
IEEE
15 years 2 months ago
A Text Watermarking Algorithm based on Word Classification and Inter-word Space Statistics
Text documents can be watermarked by patterning the inter-word spaces. This paper proposes a text watermarking algorithm that exploits the novel concepts of word classification an...
Young-Won Kim, Kyung-Ae Moon, Il-Seok Oh
99
Voted
ACIIDS
2010
IEEE
204views Database» more  ACIIDS 2010»
15 years 2 months ago
An Unsupervised Learning and Statistical Approach for Vietnamese Word Recognition and Segmentation
There are two main topics in this paper: (i) Vietnamese words are recognized and sentences are segmented into words by using probabilistic models; (ii) the optimum probabilistic mo...
Hieu Le Trung, Vu Le Anh, Kien Le Trung
CEAS
2005
Springer
15 years 3 months ago
Good Word Attacks on Statistical Spam Filters
Unsolicited commercial email is a significant problem for users and providers of email services. While statistical spam filters have proven useful, senders of spam are learning ...
Daniel Lowd, Christopher Meek
80
Voted
FINTAL
2006
15 years 1 months ago
Statistical Machine Translation of German Compound Words
Abstract. German compound words pose special problems to statistical machine translation systems: the occurence of each of the components in the training data is not sufficient for...
Maja Popovic, Daniel Stein, Hermann Ney
95
Voted
NAACL
2010
14 years 7 months ago
Statistical Machine Translation of Texts with Misspelled Words
This paper investigates the impact of misspelled words in statistical machine translation and proposes an extension of the translation engine for handling misspellings. The enhanc...
Nicola Bertoldi, Mauro Cettolo, Marcello Federico