Sciweavers

DEXA
2011
Springer
234views Database» more  DEXA 2011»
12 years 4 months ago
Learning Top-k Transformation Rules
Record linkage identifies multiple records referring to the same entity even if they are not bit-wise identical. It is thus an essential technology for data integration and data c...
Sunanda Patro, Wei Wang
AEPIA
2002
159views more  AEPIA 2002»
13 years 4 months ago
Analysing Rough Sets weighting methods for Case-Based Reasoning Systems
Case-Based Reasoning systems retrieve cases using a similarity function based on the K-NN or some derivatives. These functions are sensitive to irrelevant, interacting or noisy fe...
Maria Salamó, Elisabet Golobardes
JOI
2007
99views more  JOI 2007»
13 years 4 months ago
Measuring quality of similarity functions in approximate data matching
This paper presents a method for assessing the quality of similarity functions. The scenario taken into account is that of approximate data matching, in which it is necessary to d...
Roberto da Silva, Raquel Kolitski Stasiu, Viviane ...
ICDE
2010
IEEE
200views Database» more  ICDE 2010»
13 years 4 months ago
Towards better entity resolution techniques for Web document collections
— As person names are non-unique, the same name on different Web pages might or might not refer to the same real-world person. This entity identification problem is one of the m...
Surender Reddy Yerva, Zoltán Miklós,...
GECCO
2008
Springer
116views Optimization» more  GECCO 2008»
13 years 5 months ago
Evolving similarity functions for code plagiarism detection
Students are often asked to submit electronic copies of their program code as part of assessment in computer science courses. To counter code plagiarism, educational institutions ...
Victor Ciesielski, Nelson Wu, Seyed M. M. Tahaghog...
NIPS
2007
13 years 5 months ago
The Distribution Family of Similarity Distances
Assessing similarity between features is a key step in object recognition and scene categorization tasks. We argue that knowledge on the distribution of distances generated by sim...
Gertjan J. Burghouts, Arnold W. M. Smeulders, Jan-...
ESWS
2007
Springer
13 years 8 months ago
Imprecise SPARQL: Towards a Unified Framework for Similarity-Based Semantic Web Tasks
This proposal explores a unified framework to solve Semantic Web tasks that often require similarity measures, such as RDF retrieval, ontology alignment, and semantic service match...
Christoph Kiefer
DOCENG
2007
ACM
13 years 8 months ago
XML version detection
The problem of version detection is critical in many important application scenarios, including software clone identification, Web page ranking, plagiarism detection, and peer-to-...
Deise de Brum Saccol, Nina Edelweiss, Renata de Ma...
ICPR
2000
IEEE
13 years 8 months ago
A Unifying View of Image Similarity
We study solutions to the problem of evaluating image similarity in the context of content-based image retrieval (CBIR). Retrieval is formulated as a classification problem, wher...
Nuno Vasconcelos, Andrew Lippman
SSDBM
2010
IEEE
188views Database» more  SSDBM 2010»
13 years 9 months ago
Similarity Estimation Using Bayes Ensembles
Similarity search and data mining often rely on distance or similarity functions in order to provide meaningful results and semantically meaningful patterns. However, standard dist...
Tobias Emrich, Franz Graf, Hans-Peter Kriegel, Mat...