Sciweavers

699 search results - page 65 / 140
» Hierarchical pitman-yor language model for information retri...
Sort
View
SIGIR
2004
ACM
15 years 3 months ago
The document as an ergodic markov chain
In recent years, statistical language models are being proposed as alternative to the vector space model. Viewing documents as language samples introduces the issue of defining a...
Eduard Hoenkamp, Dawei Song
NLPRS
2001
Springer
15 years 2 months ago
A Hierarchical EM Approach to Word Segmentation
We propose a simple two-level hierarchical probability model for unsupervised word segmentation. By treating words as strings composed of morphemes/phonemes which are themselves c...
Fuchun Peng, Dale Schuurmans
TREC
2001
14 years 11 months ago
Retrieving Web Pages Using Content, Links, URLs and Anchors
For this year's web track, we concentrated on the entry page finding task. For the content-only runs, in both the ad-hoc task and the entry page finding task, we used an infor...
Thijs Westerveld, Wessel Kraaij, Djoerd Hiemstra
ACL
2009
14 years 7 months ago
A Generative Blog Post Retrieval Model that Uses Query Expansion based on External Collections
User generated content is characterized by short, noisy documents, with many spelling errors and unexpected language usage. To bridge the vocabulary gap between the user's in...
Wouter Weerkamp, Krisztian Balog, Maarten de Rijke
CIKM
2010
Springer
14 years 8 months ago
Discriminative factored prior models for personalized content-based recommendation
Most existing content-based filtering approaches including Rocchio, Language Models, SVM, Logistic Regression, Neural Networks, etc. learn user profiles independently without ca...
Lanbo Zhang, Yi Zhang 0001