Sciweavers

48 search results - page 1 / 10
» A Comparison of Techniques for Sampling Web Pages
Sort
View
STACS
2009
Springer
13 years 11 months ago
A Comparison of Techniques for Sampling Web Pages
As the World Wide Web is growing rapidly, it is getting increasingly challenging to gather representative information about it. Instead of crawling the web exhaustively one has to...
Eda Baykan, Monika Rauch Henzinger, Stefan F. Kell...
DEXA
2011
Springer
263views Database» more  DEXA 2011»
12 years 4 months ago
Sampling the National Deep Web
A huge portion of today’s Web consists of web pages filled with information from myriads of online databases. This part of the Web, known as the deep Web, is to date relatively ...
Denis Shestakov
SAC
2010
ACM
13 years 2 months ago
Host-IP clustering technique for deep web characterization
—A huge portion of todays Web consists of web pages filled with information from myriads of online databases. This part of the Web, known as the deep Web, is to date relatively ...
Denis Shestakov, Tapio Salakoski
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
13 years 10 months ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...
WEBDB
2005
Springer
102views Database» more  WEBDB 2005»
13 years 10 months ago
An Evaluation and Comparison of Current Peer-to-Peer Full-Text Keyword Search Techniques
Current peer-to-peer (p2p) full-text keyword search techniques fall into the following categories: document-based partitioning, keyword-based partitioning, hybrid indexing, and se...
Ming Zhong, Justin Moore, Kai Shen, Amy L. Murphy