Sciweavers

236 search results - page 25 / 48
» From user-centric web traffic data to usage data
Sort
View
WWW
2007
ACM
15 years 10 months ago
A large-scale study of robots.txt
Search engines largely rely on Web robots to collect information from the Web. Due to the unregulated open-access nature of the Web, robot activities are extremely diverse. Such c...
Yang Sun, Ziming Zhuang, C. Lee Giles
PVLDB
2010
105views more  PVLDB 2010»
14 years 4 months ago
A Probabilistic Approach for Automatically Filling Form-Based Web Interfaces
In this paper we present a proposal for the implementation and evaluation of a novel method for automatically using data-rich text for filling form-based input interfaces. Our sol...
Guilherme A. Toda, Eli Cortez, Altigran Soares da ...
WSC
1998
14 years 11 months ago
Communicating Structures for Modeling Large-scale Systems
ating Structures is a system abstraction that helps to model large-scale distributed systems, whose performance mostly depends on how well the data and messages traffic is organiz...
Vadim E. Kotov
IJCAI
2003
14 years 11 months ago
Information Extraction from Tree Documents by Learning Subtree Delimiters
Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...
Boris Chidlovskii
KDD
2005
ACM
153views Data Mining» more  KDD 2005»
15 years 10 months ago
Using retrieval measures to assess similarity in mining dynamic web clickstreams
While scalable data mining methods are expected to cope with massive Web data, coping with evolving trends in noisy data in a continuous fashion, and without any unnecessary stopp...
Olfa Nasraoui, Cesar Cardona, Carlos Rojas