Sciweavers

150 search results - page 3 / 30
» Attack-resistant frequency counting
Sort
View
CORR
2008
Springer
127views Education» more  CORR 2008»
14 years 9 months ago
A Very Efficient Scheme for Estimating Entropy of Data Streams Using Compressed Counting
Compressed Counting (CC) was recently proposed for approximating the th frequency moments of data streams, for 0 < 2. Under the relaxed strict-Turnstile model, CC dramaticall...
Ping Li
IJCNLP
2005
Springer
15 years 3 months ago
Detecting Article Errors Based on the Mass Count Distinction
Abstract. This paper proposes a method for detecting errors concerning article usage and singular/plural usage based on the mass count distinction. Although the mass count distinct...
Ryo Nagata, Takahiro Wakana, Fumito Masui, Atsuo K...
ACL
2008
14 years 11 months ago
Smoothing a Tera-word Language Model
Frequency counts from very large corpora, such as the Web 1T dataset, have recently become available for language modeling. Omission of low frequency n-gram counts is a practical ...
Deniz Yuret
COLING
2008
14 years 11 months ago
Source Language Markers in EUROPARL Translations
This paper shows that it is very often possible to identify the source language of medium-length speeches in the EUROPARL corpus on the basis of frequency counts of word n-grams (...
Hans van Halteren
LREC
2008
98views Education» more  LREC 2008»
14 years 11 months ago
An Inverted Index for Storing and Retrieving Grammatical Dependencies
Web count statistics gathered from search engines have been widely used as a resource in a variety of NLP tasks. For some tasks, however, the information they exploit is not fine-...
Michaela Atterer, Hinrich Schütze