Sciweavers

23 search results - page 4 / 5
» laweb 2003
Sort
View
LAWEB
2003
IEEE
15 years 2 months ago
Storing RDF as a Graph
RDF is the first W3C standard for enriching information resources of the Web with detailed meta data. The semantics of RDF data is defined using a RDF schema. The most expressiv...
Valerie Bönström, Annika Hinze, Heinz Sc...
LAWEB
2003
IEEE
15 years 2 months ago
Cooperative Crawling
Web crawler design presents many different challenges: architecture, strategies, performance and more. One of the most important research topics concerns improving the selection o...
Marina Buzzi
LAWEB
2003
IEEE
15 years 2 months ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork
LAWEB
2003
IEEE
15 years 2 months ago
Finding Buying Guides with a Web Carnivore
Research on buying behavior indicates that buying guides perform an important role in the overall buying process. However, while many buying guides can be found on the Web, findin...
Reiner Kraft, Raymie Stata
LAWEB
2003
IEEE
15 years 2 months ago
Syntactic Similarity of Web Documents
This paper presents and compares two methods for evaluating the syntactic similarity between documents. The first method uses the Patricia tree, constructed from the original doc...
Álvaro R. Pereira Jr., Nivio Ziviani