Sciweavers

3 search results - page 1 / 1
» Annotating 200 Million Words: The Bank Of English Project
Sort
View
LREC
2010
182views Education» more  LREC 2010»
13 years 6 months ago
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus
This article presents a new freely available trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia and has been automatically enriched with l...
Samuel Reese, Gemma Boleda, Montse Cuadros, Llu&ia...
KDD
2009
ACM
211views Data Mining» more  KDD 2009»
14 years 5 months ago
Address standardization with latent semantic association
Address standardization is a very challenging task in data cleansing. To provide better customer relationship management and business intelligence for customer-oriented cooperates...
Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang,...