Sciweavers

63 search results - page 1 / 13
» Phrase-based Document Similarity Based on an Index Graph Mod...
Sort
View
TKDE
2008
175views more  TKDE 2008»
13 years 4 months ago
Efficient Phrase-Based Document Similarity for Clustering
Phrase has been considered as a more informative feature term for improving the effectiveness of document clustering. In this paper, we propose a phrase-based document similarity t...
Hung Chim, Xiaotie Deng
ICDM
2002
IEEE
162views Data Mining» more  ICDM 2002»
13 years 9 months ago
Phrase-based Document Similarity Based on an Index Graph Model
Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...
Khaled M. Hammouda, Mohamed S. Kamel
DEXAW
2008
IEEE
120views Database» more  DEXAW 2008»
13 years 11 months ago
Self-Similarity Metric for Index Pruning in Conceptual Vector Space Models
— One of the critical issues in search engines is the size of search indexes: as the number of documents handled by an engine increases, the search must preserve its efficiency,...
Dario Bonino, Fulvio Corno
SAC
2011
ACM
12 years 7 months ago
Biomedical concept extraction based on combining the content-based and word order similarities
It is well known that the main objective of conceptual retrieval models is to go beyond simple term matching by relaxing term independence assumption through concept recognition. ...
Duy Dinh, Lynda Tamine
EDBT
2009
ACM
277views Database» more  EDBT 2009»
13 years 9 months ago
G-hash: towards fast kernel-based similarity search in large graph databases
Structured data including sets, sequences, trees and graphs, pose significant challenges to fundamental aspects of data management such as efficient storage, indexing, and simila...
Xiaohong Wang, Aaron M. Smalter, Jun Huan, Gerald ...