Sciweavers

52 search results - page 3 / 11
» Automatic Hidden Web Database Classification
Sort
View
AIRWEB
2006
Springer
13 years 9 months ago
Tracking Web Spam with Hidden Style Similarity
Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...
Tanguy Urvoy, Thomas Lavergne, Pascal Filoche
SAC
2005
ACM
13 years 10 months ago
Pollock: automatic generation of virtual web services from web sites
As the usage of Web Services proliferates dramatically, new tools to help quickly generate web services are needed. In this paper, we propose a methodology that helps to automatic...
Yi-Hsuan Lu, Yoojin Hong, Jinesh Varia, Dongwon Le...
ICDM
2005
IEEE
217views Data Mining» more  ICDM 2005»
13 years 11 months ago
Improving Automatic Query Classification via Semi-Supervised Learning
Accurate topical classification of user queries allows for increased effectiveness and efficiency in general-purpose web search systems. Such classification becomes critical if th...
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, ...
WWW
2001
ACM
14 years 6 months ago
Crawling the Hidden Web
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
Sriram Raghavan, Hector Garcia-Molina
SIGMOD
2004
ACM
142views Database» more  SIGMOD 2004»
14 years 5 months ago
Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax
Recently, the Web has been rapidly "deepened" by many searchable databases online, where data are hidden behind query forms. For modelling and integrating Web databases,...
Zhen Zhang, Bin He, Kevin Chen-Chuan Chang