Sciweavers

ACL
2012
12 years 11 months ago
A Broad-Coverage Normalization System for Social Media Language
Social media language contains huge amount and wide variety of nonstandard tokens, created both intentionally and unintentionally by the users. It is of crucial importance to norm...
Fei Liu, Fuliang Weng, Xiao Jiang
74
Voted
NAACL
2001
14 years 10 months ago
Identifying Cognates by Phonetic and Semantic Similarity
I present a method of identifying cognates in the vocabularies of related languages. I show that a measure of phonetic similarity based on multivalued features performs better tha...
Grzegorz Kondrak