Sciweavers

298 search results - page 22 / 60
» An information-theoretic measure for document similarity
Sort
View
CIKM
2000
Springer
15 years 4 months ago
A Semi-Supervised Document Clustering Technique for Information Organization
This paper discusses a new type of semi-supervised document clustering that uses partial supervision to partition a large set of documents. Most clustering methods organizes docum...
Han-joon Kim, Sang-goo Lee
CLEF
2004
Springer
15 years 5 months ago
IR-n r2: Using Normalized Passages
This paper describes the fourth participation of IR-n system (Alicante University) at CLEF conferences. At present conference, we have modified the similarity measure and the que...
Fernando Llopis, Rafael Muñoz, Rafael M. Te...
SIGIR
2006
ACM
15 years 5 months ago
Contextual search and name disambiguation in email using graphs
Similarity measures for text have historically been an important tool for solving information retrieval problems. In many interesting settings, however, documents are often closel...
Einat Minkov, William W. Cohen, Andrew Y. Ng
EDBT
2009
ACM
277views Database» more  EDBT 2009»
15 years 4 months ago
G-hash: towards fast kernel-based similarity search in large graph databases
Structured data including sets, sequences, trees and graphs, pose significant challenges to fundamental aspects of data management such as efficient storage, indexing, and simila...
Xiaohong Wang, Aaron M. Smalter, Jun Huan, Gerald ...
ALMOB
2008
131views more  ALMOB 2008»
14 years 12 months ago
Fast algorithms for computing sequence distances by exhaustive substring composition
The increasing throughput of sequencing raises growing needs for methods of sequence analysis and comparison on a genomic scale, notably, in connection with phylogenetic tree reco...
Alberto Apostolico, Olgert Denas