Sciweavers

23 search results - page 1 / 5
» Query Selection Techniques for Efficient Crawling of Structu...
Sort
View
ICDE
2006
IEEE
146views Database» more  ICDE 2006»
14 years 5 months ago
Query Selection Techniques for Efficient Crawling of Structured Web Sources
The high quality, structured data from Web structured sources is invaluable for many applications. Hidden Web databases are not directly crawlable by Web search engines and are on...
Ping Wu, Ji-Rong Wen, Huan Liu, Wei-Ying Ma
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
13 years 10 months ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...
WEBI
2009
Springer
13 years 11 months ago
Learning Deep Web Crawling with Diverse Features
—The key to Deep Web crawling is to submit promising keywords to query form and retrieve Deep Web content efficiently. To select keywords, existing methods make a decision based ...
Lu Jiang, Zhaohui Wu, Qinghua Zheng, Jun Liu
WWW
2005
ACM
14 years 5 months ago
Three-level caching for efficient query processing in large Web search engines
Large web search engines have to answer thousands of queries per second with interactive response times. Due to the sizes of the data sets involved, often in the range of multiple...
Xiaohui Long, Torsten Suel
SIGMOD
2006
ACM
232views Database» more  SIGMOD 2006»
14 years 4 months ago
To search or to crawl?: towards a query optimizer for text-centric tasks
Text is ubiquitous and, not surprisingly, many important applications rely on textual data for a variety of tasks. As a notable example, information extraction applications derive...
Panagiotis G. Ipeirotis, Eugene Agichtein, Pranay ...