Sciweavers

298 search results - page 11 / 60
» An information-theoretic measure for document similarity
Sort
View
IDEAL
2000
Springer
15 years 5 months ago
Clustering by Similarity in an Auxiliary Space
Abstract. We present a clustering method for continuous data. It defines local clusters into the (primary) data space but derives its similarity measure from the posterior distribu...
Janne Sinkkonen, Samuel Kaski
RIVF
2007
15 years 3 months ago
Disambiguation of People in Web Search Using a Knowledge Base
— Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different...
Quang Minh Vu, Tomonari Masada, Atsuhiro Takasu, J...
102
Voted
HICSS
2006
IEEE
118views Biometrics» more  HICSS 2006»
15 years 8 months ago
Quantitative Measures for Evaluating Knowledge Network Node Clusters: Preliminary Results
One viewpoint of a knowledge network is a knowledge map that clusters similar knowledge sources into knowledge domains. What is needed is an automatic mapping tool that 1) takes t...
Mark Pendergast, Richard Orwig
SIGIR
2010
ACM
15 years 5 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
92
Voted
MM
2005
ACM
134views Multimedia» more  MM 2005»
15 years 7 months ago
Formulating context-dependent similarity functions
Tasks of information retrieval depend on a good distance function for measuring similarity between data instances. The most effective distance function must be formulated in a con...
Gang Wu, Edward Y. Chang, Navneet Panda