Sciweavers

ACL
2006

Unsupervised Induction of Modern Standard Arabic Verb Classes Using Syntactic Frames and LSA

13 years 5 months ago
Unsupervised Induction of Modern Standard Arabic Verb Classes Using Syntactic Frames and LSA
We exploit the resources in the Arabic Treebank (ATB) and Arabic Gigaword (AG) to determine the best features for the novel task of automatically creating lexical semantic verb classes for Modern Standard Arabic (MSA). The verbs are classified into groups that share semantic elements of meaning as they exhibit similar syntactic behavior. The results of the clustering experiments are compared with a gold standard set of classes, which is approximated by using the noisy English translations provided in the ATB to create Levin-like classes for MSA. The quality of the clusters is found to be sensitive to the inclusion of syntactic frames, LSA vectors, morphological pattern, and subject animacy. The best set of parameters yields an F=1 score of 0.456, compared to a random baseline of an F=1 score of 0.205.
Neal Snider, Mona T. Diab
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where ACL
Authors Neal Snider, Mona T. Diab
Comments (0)