Search Sciweavers | Sciweavers

34 search results - page 1 / 7

» Mining the Web to Create Minority Language Corpora

181

click to vote

CIKM
2001
Springer

82views Information Technology» more CIKM 2001»

Mining the Web to Create Minority Language Corpora

15 years 11 months ago

Download www.accenture.com

The Web is a valuable source of language speci c resources but the process of collecting, organizing and utilizing these resources is di cult. We describe CorpusBuilder, an approa...

Rayid Ghani, Rosie Jones, Dunja Mladenic

claim paper

Read More »

152

click to vote

ACL
2004

88views Computational Linguistics» more ACL 2004»

Creating Multilingual Translation Lexicons with Regional Variations Using Web Corpora

15 years 8 months ago

Download www.mt-archive.info

The purpose of this paper is to automatically create multilingual translation lexicons with regional variations. We propose a transitive translation approach to determine translat...

Pu-Jen Cheng, Wen-Hsiang Lu, Jei-Wen Teng, Lee-Fen...

claim paper

Read More »

158

click to vote

WWW
2004
ACM

117views Internet Technology» more WWW 2004»

Liveclassifier: creating hierarchical text classifiers through web corpora

16 years 7 months ago

Download www.iw3c2.org

Many Web information services utilize techniques of information extraction (IE) to collect important facts from the Web. To create more advanced services, one possible method is t...

Chien-Chung Huang, Shui-Lung Chuang, Lee-Feng Chie...

claim paper

Read More »

223

Voted

IR
2008

189views Natural Language Processing» more IR 2008»

Focused web crawling in the acquisition of comparable corpora

15 years 6 months ago

Download www.info.uta.fi

CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...

Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...

claim paper

Read More »

178

click to vote

KDD
2005
ACM

185views Data Mining» more KDD 2005»

Mining comparable bilingual text corpora for cross-language information integration

16 years 7 months ago

Download sifaka.cs.uiuc.edu

Integrating information in multiple natural languages is a challenging task that often requires manually created linguistic resources such as a bilingual dictionary or examples of...

Tao Tao, ChengXiang Zhai

claim paper

Read More »

« Prev « First page 1 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers