Sciweavers

101 search results - page 7 / 21
» First-order focused crawling
Sort
View
143
Voted
IR
2008
15 years 13 days ago
Focused web crawling in the acquisition of comparable corpora
CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...
Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...
101
Voted
SAC
2003
ACM
15 years 5 months ago
Ontology-Focused Crawling of Web Documents
The Web, the largest unstructured database of the world, has greatly improved access to documents. However, documents on the Web are largely disorganized. Due to the distributed n...
Marc Ehrig, Alexander Maedche
80
Voted
ICML
2003
IEEE
16 years 1 months ago
Evolving Strategies for Focused Web Crawling
Judy Johnson, Kostas Tsioutsiouliklis, C. Lee Gile...
84
Voted
WWW
2006
ACM
16 years 1 months ago
Focused crawling: experiences in a real world project
Antonio Badia, Tulay Muezzinoglu, Olfa Nasraoui
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
15 years 6 months ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...