Sciweavers

150 search results - page 3 / 30
» Attack-resistant frequency counting
Sort
View
CORR
2008
Springer
127views Education» more  CORR 2008»
13 years 5 months ago
A Very Efficient Scheme for Estimating Entropy of Data Streams Using Compressed Counting
Compressed Counting (CC) was recently proposed for approximating the th frequency moments of data streams, for 0 < 2. Under the relaxed strict-Turnstile model, CC dramaticall...
Ping Li
IJCNLP
2005
Springer
13 years 11 months ago
Detecting Article Errors Based on the Mass Count Distinction
Abstract. This paper proposes a method for detecting errors concerning article usage and singular/plural usage based on the mass count distinction. Although the mass count distinct...
Ryo Nagata, Takahiro Wakana, Fumito Masui, Atsuo K...
ACL
2008
13 years 7 months ago
Smoothing a Tera-word Language Model
Frequency counts from very large corpora, such as the Web 1T dataset, have recently become available for language modeling. Omission of low frequency n-gram counts is a practical ...
Deniz Yuret
COLING
2008
13 years 7 months ago
Source Language Markers in EUROPARL Translations
This paper shows that it is very often possible to identify the source language of medium-length speeches in the EUROPARL corpus on the basis of frequency counts of word n-grams (...
Hans van Halteren
LREC
2008
98views Education» more  LREC 2008»
13 years 7 months ago
An Inverted Index for Storing and Retrieving Grammatical Dependencies
Web count statistics gathered from search engines have been widely used as a resource in a variety of NLP tasks. For some tasks, however, the information they exploit is not fine-...
Michaela Atterer, Hinrich Schütze