Search Sciweavers | Sciweavers

23 search results - page 2 / 5

» Focused web crawling in the acquisition of comparable corpor...

click to vote

ICDM
2008
IEEE

186views Data Mining» more ICDM 2008»

xCrawl: A High-Recall Crawling Method for Web Mining

13 years 11 months ago

Download ls13-www.cs.uni-dortmund.de

Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The ﬁrst step in the Information Extract...

Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...

claim paper

Read More »

click to vote

ADMA
2009
Springer

142views Data Mining» more ADMA 2009»

Crawling Deep Web Using a New Set Covering Algorithm

13 years 11 months ago

Download cs.uwindsor.ca

Abstract. Crawling the deep web often requires the selection of an appropriate set of queries so that they can cover most of the documents in the data source with low cost. This ca...

Yan Wang, Jianguo Lu, Jessica Chen

claim paper

Read More »

click to vote

ERCIMDL
2003
Springer

106views Education» more ERCIMDL 2003»

Topical Crawling for Business Intelligence

13 years 10 months ago

Download dollar.biz.uiowa.edu

Abstract. The Web provides us with a vast resource for business intelligence. However, the large size of the Web and its dynamic nature make the task of foraging appropriate inform...

Gautam Pant, Filippo Menczer

claim paper

Read More »

click to vote

KCAP
2005
ACM

119views Information Technology» more KCAP 2005»

Collecting paraphrase corpora from volunteer contributors

13 years 10 months ago

Download www.openmind.org

Extensive and deep paraphrase corpora are important for a variety of natural language processing and user interaction tasks. In this paper, we present an approach which i) collect...

Timothy Chklovski

claim paper

Read More »

click to vote

COLING
2010

156views Computational Linguistics» more COLING 2010»

Automatic Acquisition of Lexical Formality

12 years 11 months ago

Download ftp.cs.toronto.edu

There has been relatively little work focused on determining the formality level of individual lexical items. This study applies information from large mixedgenre corpora, demonst...

Julian Brooke, Tong Wang, Graeme Hirst

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers