Search Sciweavers | Sciweavers

101 search results - page 7 / 21

» First-order focused crawling

198

click to vote

IR
2008

189views Natural Language Processing» more IR 2008»

Focused web crawling in the acquisition of comparable corpora

15 years 5 months ago

Download www.info.uta.fi

CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...

Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...

claim paper

Read More »

142

Voted

SAC
2003
ACM

133views Applied Computing» more SAC 2003»

Ontology-Focused Crawling of Web Documents

15 years 10 months ago

Download dspc11.cs.ccu.edu.tw

The Web, the largest unstructured database of the world, has greatly improved access to documents. However, documents on the Web are largely disorganized. Due to the distributed n...

Marc Ehrig, Alexander Maedche

claim paper

Read More »

124

click to vote

ICML
2003
IEEE

180views Machine Learning» more ICML 2003»

Evolving Strategies for Focused Web Crawling

16 years 5 months ago

Download clgiles.ist.psu.edu

Judy Johnson, Kostas Tsioutsiouliklis, C. Lee Gile...

claim paper

Read More »

123

Voted

WWW
2006
ACM

95views Internet Technology» more WWW 2006»

Focused crawling: experiences in a real world project

16 years 5 months ago

Download www2006.org

Antonio Badia, Tulay Muezzinoglu, Olfa Nasraoui

claim paper

Read More »

176

click to vote

ICDM
2008
IEEE

186views Data Mining» more ICDM 2008»

xCrawl: A High-Recall Crawling Method for Web Mining

15 years 11 months ago

Download ls13-www.cs.uni-dortmund.de

Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The ﬁrst step in the Information Extract...

Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...

claim paper

Read More »

« Prev « First page 7 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers