Sciweavers

13 search results - page 1 / 3
» Learning to Probabilistically Identify Authoritative Documen...
Sort
View
ICML
2000
IEEE
14 years 5 months ago
Learning to Probabilistically Identify Authoritative Documents
We describe a model of document citation that learns to identify hubs and authorities in a set of linked documents, such as pages retrieved from the world wide web, or papers retr...
David Cohn, Huan Chang
WWW
2008
ACM
14 years 5 months ago
Mining the search trails of surfing crowds: identifying relevant websites from user activity
The paper proposes identifying relevant information sources from the history of combined searching and browsing behavior of many Web users. While it has been previously shown that...
Mikhail Bilenko, Ryen W. White
NIPS
2000
13 years 5 months ago
The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity
We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...
David A. Cohn, Thomas Hofmann
DEXAW
2010
IEEE
202views Database» more  DEXAW 2010»
13 years 5 months ago
Identifying Sentence-Level Semantic Content Units with Topic Models
Abstract--Statistical approaches to document content modeling typically focus either on broad topics or on discourselevel subtopics of a text. We present an analysis of the perform...
Leonhard Hennig, Thomas Strecker, Sascha Narr, Ern...
CORR
2011
Springer
173views Education» more  CORR 2011»
12 years 11 months ago
Probability Based Clustering for Document and User Properties
Information Retrieval systems can be improved by exploiting context information such as user and document features. This article presents a model based on overlapping probabilistic...
Thomas Mandl, Christa Womser-Hacker