Sciweavers

DEBU
2000
147views more  DEBU 2000»
13 years 4 months ago
XJoin: A Reactively-Scheduled Pipelined Join Operator
Wide-area distribution raises significant performance problems for traditional query processing techniques as data access becomes less predictable due to link congestion, load imb...
Tolga Urhan, Michael J. Franklin
DEBU
2000
115views more  DEBU 2000»
13 years 4 months ago
Database Design for Real-World E-Commerce Systems
This paper discusses the structure and components of databases for real-world e-commerce systems. We first present an integrated 8-process value chain needed by the e-commerce sys...
Il-Yeol Song, Kyu-Young Whang
DEBU
2000
108views more  DEBU 2000»
13 years 4 months ago
Data Cleaning: Problems and Current Approaches
We classify data quality problems that are addressed by data cleaning and provide an overview of the main solution approaches. Data cleaning is especially required when integratin...
Erhard Rahm, Hong Hai Do
DEBU
2000
118views more  DEBU 2000»
13 years 4 months ago
Matching Algorithms within a Duplicate Detection System
Detecting database records that are approximate duplicates, but not exact duplicates, is an important task. Databases may contain duplicate records concerning the same real-world ...
Alvaro E. Monge
DEBU
2000
96views more  DEBU 2000»
13 years 4 months ago
What do the Neighbours Think? Computing Web Page Reputations
The textual content of the Web enriched with the hyperlink structure surrounding it can be a useful source of information for querying and searching. This paper presents a search ...
Alberto O. Mendelzon, Davood Rafiei
DEBU
2000
80views more  DEBU 2000»
13 years 4 months ago
Context in Web Search
Steve Lawrence
DEBU
2000
95views more  DEBU 2000»
13 years 4 months ago
Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach
A critical problem in developing information agents for the Web is accessing data that is formatted for human use. We have developed a set of tools for extracting data from web si...
Craig A. Knoblock, Kristina Lerman, Steven Minton,...
DEBU
2000
108views more  DEBU 2000»
13 years 4 months ago
Next Generation Web Search: Setting Our Sites
The current state of web search is most successful at directing users to appropriate web sites. Once at the site, the user has a choice of following hyperlinks or using site searc...
Marti A. Hearst
DEBU
2000
147views more  DEBU 2000»
13 years 4 months ago
Adaptive Query Processing for Internet Applications
As the area of data management for the Internet has gained in popularity, recent work has focused on effectively dealing with unpredictable, dynamic data volumes and transfer rate...
Zachary G. Ives, Alon Y. Levy, Daniel S. Weld, Dan...
DEBU
2000
125views more  DEBU 2000»
13 years 4 months ago
Link Analysis in Web Information Retrieval
The analysis of the hyperlink structure of the web has led to significant improvements in web information retrieval. This survey describes two successful link analysis algorithms ...
Monika Rauch Henzinger