Sciweavers

ACL
2009
13 years 2 months ago
Knowing the Unseen: Estimating Vocabulary Size over Unseen Samples
Empirical studies on corpora involve making measurements of several quantities for the purpose of comparing corpora, creating language models or to make generalizations about spec...
Suma Bhat, Richard Sproat