Sciweavers

ACL
2012
13 years 17 days ago
A Broad-Coverage Normalization System for Social Media Language
Social media language contains huge amount and wide variety of nonstandard tokens, created both intentionally and unintentionally by the users. It is of crucial importance to norm...
Fei Liu, Fuliang Weng, Xiao Jiang
79
Voted
NAACL
2001
14 years 11 months ago
Identifying Cognates by Phonetic and Semantic Similarity
I present a method of identifying cognates in the vocabularies of related languages. I show that a measure of phonetic similarity based on multivalued features performs better tha...
Grzegorz Kondrak