Sciweavers

LREC
2008
111views Education» more  LREC 2008»
14 years 10 months ago
Low-Density Language Bootstrapping: the Case of Tajiki Persian
Low-density languages raise difficulties for standard approaches to natural language processing that depend on large online corpora. Using Persian as a case study, we propose a no...
Karine Megerdoomian, Dan Parvaz
66
Voted
LREC
2010
155views Education» more  LREC 2010»
14 years 10 months ago
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese
In this paper we bring to light a novel intersection between corpus linguistics and behavioral data that can be employed as an evaluation metric for resources for low-density lang...
Jerid Francom, Amy LaCross, Adam Ussishkin