Sciweavers

34 search results - page 1 / 7
» Mining the Web to Create Minority Language Corpora
Sort
View
CIKM
2001
Springer
13 years 12 months ago
Mining the Web to Create Minority Language Corpora
The Web is a valuable source of language speci c resources but the process of collecting, organizing and utilizing these resources is di cult. We describe CorpusBuilder, an approa...
Rayid Ghani, Rosie Jones, Dunja Mladenic
ACL
2004
13 years 8 months ago
Creating Multilingual Translation Lexicons with Regional Variations Using Web Corpora
The purpose of this paper is to automatically create multilingual translation lexicons with regional variations. We propose a transitive translation approach to determine translat...
Pu-Jen Cheng, Wen-Hsiang Lu, Jei-Wen Teng, Lee-Fen...
WWW
2004
ACM
14 years 8 months ago
Liveclassifier: creating hierarchical text classifiers through web corpora
Many Web information services utilize techniques of information extraction (IE) to collect important facts from the Web. To create more advanced services, one possible method is t...
Chien-Chung Huang, Shui-Lung Chuang, Lee-Feng Chie...
IR
2008
13 years 7 months ago
Focused web crawling in the acquisition of comparable corpora
CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...
Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...
KDD
2005
ACM
185views Data Mining» more  KDD 2005»
14 years 7 months ago
Mining comparable bilingual text corpora for cross-language information integration
Integrating information in multiple natural languages is a challenging task that often requires manually created linguistic resources such as a bilingual dictionary or examples of...
Tao Tao, ChengXiang Zhai