Search Sciweavers | Sciweavers

563 search results - page 15 / 113

» Crawling the web for structured documents

160

Voted

TKDE
2002

111views more TKDE 2002»

Query Relaxation by Structure and Semantics for Retrieval of Logical Web Documents

15 years 5 months ago

Download www.public.asu.edu

Since WWW encourages hypertext and hypermedia document authoring (e.g. HTML or XML), Web authors tend to create documents that are composed of multiple pages connected with hyperl...

Wen-Syan Li, K. Selçuk Candan, Quoc Vu, Div...

claim paper

Read More »

176

click to vote

CIT
2005
Springer

226views Information Technology» more CIT 2005»

Simple Classification into Large Topic Ontology of Web Documents

15 years 5 months ago

Download eprints.pascal-network.org

The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology an...

Marko Grobelnik, Dunja Mladenic

claim paper

Read More »

158

click to vote

ECIR
2006
Springer

134views Information Technology» more ECIR 2006»

Automatic Document Organization in a P2P Environment

15 years 7 months ago

Download ir.shef.ac.uk

Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...

Stefan Siersdorfer, Sergej Sizov

claim paper

Read More »

136

click to vote

LAWEB
2003
IEEE

96views Internet Technology» more LAWEB 2003»

On the Evolution of Clusters of Near-Duplicate Web Pages

15 years 11 months ago

Download research.microsoft.com

This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...

Dennis Fetterly, Mark Manasse, Marc Najork

claim paper

Read More »

159

click to vote

WIDM
2006
ACM

148views Internet Technology» more WIDM 2006»

Coarse-grained classification of web sites by their structural properties

15 years 12 months ago

Download rvs.informatik.uni-leipzig.de

In this paper, we identify and analyze structural properties which reflect the functionality of a Web site. These structural properties consider the size, the organization, the co...

Christoph Lindemann, Lars Littig

claim paper

Read More »

« Prev « First page 15 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers