Sciweavers

642 search results - page 53 / 129
» Automatic Wrapper Generation for Web Search Engines
Sort
View
GCC
2005
Springer
15 years 3 months ago
Parallel Web Spiders for Cooperative Information Gathering
Web spider is a widely used approach to obtain information for search engines. As the size of the Web grows, it becomes a natural choice to parallelize the spider’s crawling proc...
Jiewen Luo, Zhongzhi Shi, Maoguang Wang, Wei Wang
SIGIR
2004
ACM
15 years 3 months ago
Query-related data extraction of hidden web documents
The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...
Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...
WSDM
2010
ACM
315views Data Mining» more  WSDM 2010»
15 years 7 months ago
SBotMiner: Large Scale Search Bot Detection
In this paper, we study search bot traffic from search engine query logs at a large scale. Although bots that generate search traffic aggressively can be easily detected, a large ...
Fang Yu, Yinglian Xie, Qifa Ke
ESEC
1999
Springer
15 years 2 months ago
Components and Generative Programming
This paper is about a paradigm shift from the current practice of manually searching for and adapting components and their manual assembly to Generative Programming, which is the a...
Krzysztof Czarnecki, Ulrich W. Eisenecker
IUI
2009
ACM
15 years 6 months ago
Interactive multimodal transcription of text images using a web-based demo system
This document introduces a web based demo of an interactive framework for transcription of handwritten text, where the user feedback is provided by means of pen strokes on a touch...
Verónica Romero, Luis A. Leiva, Alejandro H...