Sciweavers

298 search results - page 23 / 60
» An information-theoretic measure for document similarity
Sort
View
SAC
2010
ACM
15 years 6 months ago
Hypothesis generation and ranking based on event similarities
Accelerated by the technological advances in the domain, the size of the biomedical literature has been growing rapidly. As a result, it is not feasible for individual researchers...
Taiki Miyanishi, Kazuhiro Seki, Kuniaki Uehara
ISAAC
2005
Springer
138views Algorithms» more  ISAAC 2005»
15 years 5 months ago
On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm
In this paper, we prove for the first time that the learning complexity of Rocchio’s algorithm is O(d+d2 (log d+log n)) over the discretized vector space {0, . . . , n − 1}d ,...
Zhixiang Chen, Bin Fu
CICLING
2007
Springer
15 years 6 months ago
Clustering Narrow-Domain Short Texts by Using the Kullback-Leibler Distance
Clustering short length texts is a difficult task itself, but adding the narrow domain characteristic poses an additional challenge for current clustering methods. We addressed thi...
David Pinto, José-Miguel Benedí, Pao...
SODA
2000
ACM
123views Algorithms» more  SODA 2000»
15 years 1 months ago
Communication complexity of document exchange
We address the problem of minimizing the communication involved in the exchange of similar documents. We consider two users, A and B, who hold documents x and y respectively. Neit...
Graham Cormode, Mike Paterson, Süleyman Cenk ...
DOCENG
2007
ACM
15 years 3 months ago
XML version detection
The problem of version detection is critical in many important application scenarios, including software clone identification, Web page ranking, plagiarism detection, and peer-to-...
Deise de Brum Saccol, Nina Edelweiss, Renata de Ma...