Sciweavers

4468 search results - page 516 / 894
» Processing Forecasting Queries
Sort
View
ICDE
2010
IEEE
204views Database» more  ICDE 2010»
15 years 11 months ago
ProbClean: A probabilistic duplicate detection system
— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
ICWE
2009
Springer
15 years 11 months ago
On Using Distributed Extended XQuery for Web Data Sources as Services
DeXIN (Distributed extended XQuery for data INtegration) integrates multiple, heterogeneous, highly distributed and rapidly changing web data sources in different formats, e.g. XML...
Muhammad Intizar Ali, Reinhard Pichler, Hong Linh ...
RECSYS
2009
ACM
15 years 11 months ago
A partial-order based active cache for recommender systems
Recommender systems aim to substantially reduce information overload by suggesting lists of similar items that users may find interesting. Caching has been a useful technique for...
Umar Qasim, Vincent Oria, Yi-fang Brook Wu, Michae...
DASFAA
2008
IEEE
118views Database» more  DASFAA 2008»
15 years 11 months ago
RAIN: Always on Data Warehousing
Abstract. The Redundant Arrays of Inexpensive DWS Nodes (RAIN) technique is a node-level data replication approach that introduces failover capabilities to DWS (Data Warehouse Stri...
Jorge Vieira, Marco Vieira, Marco Costa, Henrique ...
SISAP
2008
IEEE
147views Data Mining» more  SISAP 2008»
15 years 11 months ago
An Empirical Evaluation of a Distributed Clustering-Based Index for Metric Space Databases
Similarity search has been proved suitable for searching in very large collections of unstructured data objects. We are interested in efficient parallel query processing under si...
Veronica Gil Costa, Mauricio Marín, Nora Re...