Sciweavers

1715 search results - page 54 / 343
» Document Retrieval using a Probabilistic Knowledge Model
Sort
View
CIKM
2011
Springer
13 years 9 months ago
Probabilistic near-duplicate detection using simhash
This paper offers a novel look at using a dimensionalityreduction technique called simhash [8] to detect similar document pairs in large-scale collections. We show that this algo...
Sadhan Sood, Dmitri Loguinov
SIGIR
2005
ACM
15 years 3 months ago
Modeling task-genre relationships for IR in the workplace
Context influences the search process, but to date research has not definitively identified which aspects of context are the most influential for information retrieval, and thus a...
Luanne Freund, Elaine G. Toms, Charles L. A. Clark...
SIGIR
2005
ACM
15 years 3 months ago
Relation between PLSA and NMF and implications
Non-negative Matrix Factorization (NMF, [5]) and Probabilistic Latent Semantic Analysis (PLSA, [4]) have been successfully applied to a number of text analysis tasks such as docum...
Éric Gaussier, Cyril Goutte
KCAP
2005
ACM
15 years 3 months ago
Enhancing knowledge mapping using automatically derived concepts
Knowledge-mapping tools enable users to quickly identify relevant information and expertise. This paper discusses a number of natural-language phenomena that limit the performance...
Anjo Anjewierden, Willem-Olaf Huijsen, Marjan Groo...
SIGIR
2003
ACM
15 years 3 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann