Sciweavers

298 search results - page 17 / 60
» An information-theoretic measure for document similarity
Sort
View
WSDM
2010
ACM
261views Data Mining» more  WSDM 2010»
15 years 9 months ago
Learning Similarity Metrics for Event Identification in Social Media
Social media sites (e.g., Flickr, YouTube, and Facebook) are a popular distribution outlet for users looking to share their experiences and interests on the Web. These sites host ...
Hila Becker, Mor Naaman, Luis Gravano
ICPR
2008
IEEE
16 years 1 months ago
Clustering of short commercial documents for the web
Document clustering techniques have been applied in several areas, with the web as one of the most recent and influent. Both general-purpose and text-oriented techniques exist and...
Elisabetta Binaghi, Ignazio Gallo, Moreno Carullo,...
DEXA
2009
Springer
176views Database» more  DEXA 2009»
14 years 9 months ago
Analyzing Document Retrievability in Patent Retrieval Settings
Most information retrieval settings, such as web search, are typically precision-oriented, i.e. they focus on retrieving a small number of highly relevant documents. However, in sp...
Shariq Bashir, Andreas Rauber
WISE
2005
Springer
15 years 5 months ago
Document Re-ranking by Generality in Bio-medical Information Retrieval
Document ranking is well known to be a crucial process in information retrieval (IR). It presents retrieved documents in an order of their estimated degrees of relevance to query. ...
Xin Yan, Xue Li, Dawei Song
PRICAI
2000
Springer
15 years 3 months ago
Text Retrieval from Document Images based on N-Gram Algorithm
In this paper, we propose a method of text retrieval from document images using a similarity measure based on an N-Gram algorithm. We directly extract image features instead of us...
Chew Lim Tan, Sam Yuan Sung, Zhaohui Yu, Yi Xu