Search Sciweavers | Sciweavers

128 search results - page 1 / 26

» Scaling up duplicate detection in graph data

click to vote

CIKM
2008
Springer

146views Information Technology» more CIKM 2008»

Scaling up duplicate detection in graph data

13 years 6 months ago

Download www.hpi.uni-potsdam.de

Duplicate detection determines different representations of realworld objects in a database. Recent research has considered the use of relationships among object representations t...

Melanie Herschel, Felix Naumann

claim paper

Read More »

click to vote

KDD
2012
ACM

271views Data Mining» more KDD 2012»

GigaTensor: scaling tensor analysis up by 100 times - algorithms and discoveries

11 years 7 months ago

Download www.cs.cmu.edu

Many data are modeled as tensors, or multi dimensional arrays. Examples include the predicates (subject, verb, object) in knowledge bases, hyperlinks and anchor texts in the Web g...

U. Kang, Evangelos E. Papalexakis, Abhay Harpale, ...

claim paper

Read More »

click to vote

ICDE
2003
IEEE

159views Database» more ICDE 2003»

Scaling up the ALIAS Duplicate Elimination System

14 years 6 months ago

Download www.it.iitb.ac.in

Duplicate elimination is an important stage in integrating data from multiple sources. The challenges involved are finding a robust deduplication function that can identify when t...

Sunita Sarawagi, Alok Kirpal

claim paper

Read More »

click to vote

OOPSLA
2005
Springer

203views Security Privacy» more OOPSLA 2005»

SDD: high performance code clone detection system for large scale source code

13 years 10 months ago

Download www.cs.toronto.edu

Code clones in software increase maintenance cost and lower software quality. We have devised a new algorithm to detect duplicated parts of source code in large software. Our algo...

Seunghak Lee, Iryoung Jeong

claim paper

Read More »

click to vote

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

14 years 5 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

« Prev « First page 1 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers