Sciweavers

3657 search results - page 1 / 732
» A Study of Practical Deduplication
Sort
View
FAST
2011
12 years 8 months ago
A Study of Practical Deduplication
We collected file system content data from 857 desktop computers at Microsoft over a span of 4 weeks. We analyzed the data to determine the relative efficacy of data deduplication...
Dutch T. Meyer, William J. Bolosky
KDD
2008
ACM
176views Data Mining» more  KDD 2008»
14 years 4 months ago
Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface
Matching records that refer to the same entity across databases is becoming an increasingly important part of many data mining projects, as often data from multiple sources needs ...
Peter Christen
SIGMOD
2009
ACM
142views Database» more  SIGMOD 2009»
14 years 4 months ago
A grammar-based entity representation framework for data cleaning
Fundamental to data cleaning is the need to account for multiple data representations. We propose a formal framework that can be used to reason about and manipulate data represent...
Arvind Arasu, Raghav Kaushik
IEEECIT
2010
IEEE
13 years 2 months ago
Study on Evaluation Method of Practical Teaching: A Practical Teaching Test Analysis
The practical teaching is a key to train students in practical ability and innovative consciousness. But, evaluation standards of practice teaching are still confusion. By analyzin...
Haiwei Jin
HICSS
2006
IEEE
160views Biometrics» more  HICSS 2006»
13 years 10 months ago
A Case Study of a Longstanding Online Community of Practice Involving Critical Care and Advanced Practice Nurses
The aims of this study are: (1) to examine to what extent critical care and advanced practice nurses’ participation in an online listserv constituted a community of practice, an...
Noriko Hara, Khe Foon Hew