Sciweavers

11 search results - page 1 / 3
» Entity resolution with iterative blocking
Sort
View
SIGMOD
2009
ACM
137views Database» more  SIGMOD 2009»
14 years 4 months ago
Entity resolution with iterative blocking
Entity Resolution (ER) is the problem of identifying which records in a database refer to the same real-world entity. An exhaustive ER process involves computing the similarities b...
Steven Euijong Whang, David Menestrina, Georgia Ko...
CORR
2010
Springer
143views Education» more  CORR 2010»
13 years 4 months ago
Parallel Sorted Neighborhood Blocking with MapReduce
: Cloud infrastructures enable the efficient parallel execution of data-intensive tasks such as entity resolution on large datasets. We investigate challenges and possible solution...
Lars Kolb, Andreas Thor, Erhard Rahm
WSDM
2012
ACM
329views Data Mining» more  WSDM 2012»
12 years 2 days ago
Beyond 100 million entities: large-scale blocking-based resolution for heterogeneous data
A prerequisite for leveraging the vast amount of data available on the Web is Entity Resolution, i.e., the process of identifying and linking data that describe the same real-worl...
George Papadakis, Ekaterini Ioannou, Claudia Niede...
CIKM
2011
Springer
12 years 4 months ago
Block-based load balancing for entity resolution with MapReduce
The effectiveness and scalability of MapReduce-based implementations of complex data-intensive tasks depend on an even redistribution of data between map and reduce tasks. In the...
Lars Kolb, Andreas Thor, Erhard Rahm
GD
2005
Springer
13 years 10 months ago
D-Dupe: An Interactive Tool for Entity Resolution in Social Networks
Visualizing and analyzing social networks is a challenging problem that has been receiving growing attention. An important first step, before analysis can begin, is ensuring that...
Mustafa Bilgic, Louis Licamele, Lise Getoor, Ben S...