Search Sciweavers | Sciweavers

260 search results - page 26 / 52

» Industry-scale duplicate detection

133

click to vote

WWW
2004
ACM

128views Internet Technology» more WWW 2004»

Web data integration using approximate string join

16 years 2 months ago

Download www.iw3c2.org

Web data integration is an important preprocessing step for web mining. It is highly likely that several records on the web whose textual representations differ may represent the ...

Yingping Huang, Gregory R. Madey

claim paper

Read More »

103

click to vote

ICC
2008
IEEE

163views Communications» more ICC 2008»

A New Replay Attack Against Anonymous Communication Networks

15 years 8 months ago

Download www.cs.uml.edu

Abstract— Tor is a real-world, circuit-based low-latency anonymous communication network, supporting TCP applications on the Internet. In this paper, we present a new class of at...

Ryan Pries, Wei Yu, Xinwen Fu, Wei Zhao

claim paper

Read More »

107

click to vote

ITC
2003
IEEE

141views Hardware» more ITC 2003»

Cost-Effective Approach for Reducing Soft Error Failure Rate in Logic Circuits

15 years 7 months ago

Download www.ece.rice.edu

In this paper, a new paradigm for designing logic circuits with concurrent error detection (CED) is described. The key idea is to exploit the asymmetric soft error susceptibility ...

Kartik Mohanram, Nur A. Touba

claim paper

Read More »

128

click to vote

FAST
2010

295views Operating System» more FAST 2010»

HydraFS: A High-Throughput File System for the HYDRAstor Content-Addressable Storage System

15 years 4 months ago

Download www.usenix.org

A content-addressable storage (CAS) system is a valuable tool for building storage solutions, providing efficiency by automatically detecting and eliminating duplicate blocks; it ...

Cristian Ungureanu, Benjamin Atkin, Akshat Aranya,...

claim paper

Read More »

130

click to vote

IJCAI
2003

145views Artificial Intelligence» more IJCAI 2003»

Employing Trainable String Similarity Metrics for Information Integration

15 years 3 months ago

Download www.isi.edu

The problem of identifying approximately duplicate objects in databases is an essential step for the information integration process. Most existing approaches have relied on gener...

Mikhail Bilenko, Raymond J. Mooney

claim paper

Read More »

« Prev « First page 26 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers