Sciweavers

1014 search results - page 18 / 203
» Using Keyword Extraction for Web Site Clustering
Sort
View
62
Voted
WWW
2006
ACM
15 years 10 months ago
GoGetIt!: a tool for generating structure-driven web crawlers
We present GoGetIt!, a tool for generating structure-driven crawlers that requires a minimum effort from the users. The tool takes as input a sample page and an entry point to a W...
Altigran Soares da Silva, Edleno Silva de Moura, J...
75
Voted
VLDB
2004
ACM
121views Database» more  VLDB 2004»
15 years 3 months ago
An Automatic Data Grabber for Large Web Sites
We demonstrate a system to automatically grab data from data intensive web sites. The system first infers a model that describes at the intensional level the web site as a collec...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
77
Voted
WIDM
2003
ACM
15 years 2 months ago
Schema-guided wrapper maintenance for web-data extraction
Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. There are two main issues relevant t...
Xiaofeng Meng, Dongdong Hu, Chen Li
TMM
2002
140views more  TMM 2002»
14 years 9 months ago
Narrowing the semantic gap - improved text-based web document retrieval using visual features
In this paper, we present the results of our work that seek to negotiate the gap between low-level features and high-level concepts in the domain of web document retrieval. This wo...
Rong Zhao, William I. Grosky
DMKD
2000
ACM
165views Data Mining» more  DMKD 2000»
15 years 2 months ago
On Mining Web Access Logs
The proliferation of information on the world wide web has made the personalization of this information space a necessity. One possible approach to web personalization is to mine ...
Anupam Joshi, Raghu Krishnapuram