Sciweavers

LAWEB
2003
IEEE
13 years 9 months ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork
LAWEB
2003
IEEE
13 years 9 months ago
The Best Trail Algorithm for Assisted Navigation of Web Sites
We present an algorithm called the Best Trail Algorithm, which helps solve the hypertext navigation problem by automating the construction of memex-like trails through the corpus....
Richard Wheeldon, Mark Levene
LAWEB
2003
IEEE
13 years 9 months ago
Finding Related Hubs and Authorities
Paul-Alexandru Chirita, Daniel Olmedilla, Wolfgang...
LAWEB
2003
IEEE
13 years 9 months ago
Quantitative Analysis of Strategies for Streaming Media Distribution
Streaming media applications are becoming more popular on the late years, as for example, news transmitted live through the web, music, show, and films. Traditional client/server...
Marisa A. Vasconcelos, Leonardo C. da Rocha, Julia...
LAWEB
2003
IEEE
13 years 9 months ago
Cooperation Schemes between a Web Server and a Web Search Engine
Search engines provide search results based on a large repository of pages downloaded by a web crawler from several servers. To provide best results, this repository must be kept ...
Carlos Castillo
LAWEB
2003
IEEE
13 years 9 months ago
Clustering the Chilean Web
We perform a clustering of the Chilean Web Graph using a local fitness measure, optimized by simulated annealing, and compare the obtained cluster distribution to that of two mod...
Satu Virtanen
LAWEB
2003
IEEE
13 years 9 months ago
Cooperative Crawling
Web crawler design presents many different challenges: architecture, strategies, performance and more. One of the most important research topics concerns improving the selection o...
Marina Buzzi
LAWEB
2003
IEEE
13 years 9 months ago
A Semantic Matching Method for Clustering Traders in B2B Systems
Ricardo Ferraz Tomaz, Sofiane Labidi, Bernardo Wan...
LAWEB
2003
IEEE
13 years 9 months ago
Storing RDF as a Graph
RDF is the first W3C standard for enriching information resources of the Web with detailed meta data. The semantics of RDF data is defined using a RDF schema. The most expressiv...
Valerie Bönström, Annika Hinze, Heinz Sc...
LAWEB
2003
IEEE
13 years 9 months ago
Collaborative Learning and Creative Writing
CSCL software tools must provide support for group work and should be based on a collaborative learning technique. The PBL based CCCuento tool is introduced here. It is intended t...
Luis A. Guerrero, Boris Mejías, Césa...