Sciweavers

EACL
2006
ACL Anthology
13 years 6 months ago
A Figure of Merit for the Evaluation of Web-Corpus Randomness
In this paper, we present an automated, quantitative, knowledge-poor method to evaluate the randomness of a collection of documents (corpus), with respect to a number of biased pa...
Massimiliano Ciaramita, Marco Baroni