Search Sciweavers | Sciweavers

43 search results - page 8 / 9

» Crawling the Content Hidden Behind Web Forms

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

14 years 6 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

click to vote

CIKM
2009
Springer

165views Information Technology» more CIKM 2009»

An empirical study on using hidden markov model for search interface segmentation

13 years 10 months ago

Download www.pages.drexel.edu

This paper describes a hidden Markov model (HMM) based approach to perform search interface segmentation. Automatic processing of an interface is a must to access the invisible co...

Ritu Khare, Yuan An

claim paper

Read More »

click to vote

SSDBM
2008
IEEE

149views Database» more SSDBM 2008»

Query Planning for Searching Inter-dependent Deep-Web Databases

14 years 4 days ago

Download www.cse.ohio-state.edu

Increasingly, many data sources appear as online databases, hidden behind query forms, thus forming what is referred to as the deep web. It is desirable to have systems that can pr...

Fan Wang, Gagan Agrawal, Ruoming Jin

claim paper

Read More »

click to vote

IPM
2007

156views more IPM 2007»

p2pDating: Real life inspired semantic overlay networks for Web search

13 years 5 months ago

Download hdir2005.isti.cnr.it

We consider a network of autonomous peers forming a logically global but physically distributed search engine, where every peer has its own local collection generated by independe...

Josiane Xavier Parreira, Sebastian Michel, Gerhard...

claim paper

Read More »

click to vote

TOIS
2008

145views more TOIS 2008»

Classification-aware hidden-web text database selection

13 years 5 months ago

Download archive.nyu.edu

Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over multip...

Panagiotis G. Ipeirotis, Luis Gravano

claim paper

Read More »

« Prev « First page 8 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers