Search Sciweavers | Sciweavers

563 search results - page 32 / 113

» Crawling the web for structured documents

160

click to vote

TREC
2004

127views Information Technology» more TREC 2004»

Language Models for Searching in Web Corpora

15 years 7 months ago

Download trec.nist.gov

: We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and...

Jaap Kamps, Gilad Mishne, Maarten de Rijke

claim paper

Read More »

169

click to vote

KDD
2007
ACM

231views Data Mining» more KDD 2007»

Xproj: a framework for projected structural clustering of xml documents

16 years 6 months ago

Download www.cs.rpi.edu

XML has become a popular method of data representation both on the web and in databases in recent years. One of the reasons for the popularity of XML has been its ability to encod...

Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua F...

claim paper

Read More »

158

click to vote

CIKM
2007
Springer

134views Information Technology» more CIKM 2007»

Effective top-k computation in retrieving structured documents with term-proximity support

16 years 7 days ago

Download sewm.pku.edu.cn

Modern web search engines are expected to return top-k results efficiently given a query. Although many dynamic index pruning strategies have been proposed for efficient top-k com...

Mingjie Zhu, Shuming Shi, Mingjing Li, Ji-Rong Wen

claim paper

Read More »

150

click to vote

VRML
1995
ACM

129views Internet Technology» more VRML 1995»

Visualizing the Structure of the World Wide Web in 3D Hyperbolic Space

15 years 9 months ago

Download graphics.stanford.edu

We visualize the structure of sections of the World Wide Web by constructing graphical representations in 3D hyperbolic space. The felicitous property that hyperbolic space has �...

Tamara Munzner, Paul Burchard

claim paper

Read More »

264

click to vote

PREMI
2011
Springer

216views Pattern Recognition» more PREMI 2011»

Finding Potential Seeds through Rank Aggregation of Web Searches

14 years 9 months ago

Download www.idi.ntnu.no

This paper presents a potential seed selection algorithm for web crawlers using a gain - share scoring approach. Initially we consider a set of arbitrarily chosen tourism queries. ...

Rajendra Prasath, Pinar Öztürk

claim paper

Read More »

« Prev « First page 32 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers