Sciweavers

486 search results - page 42 / 98
» A Bayesian Framework for Reinforcement Learning
Sort
View
ICML
2001
IEEE
15 years 10 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
ICML
2005
IEEE
15 years 10 months ago
Preference learning with Gaussian processes
In this paper, we propose a probabilistic kernel approach to preference learning based on Gaussian processes. A new likelihood function is proposed to capture the preference relat...
Wei Chu, Zoubin Ghahramani
77
Voted
UAI
1997
14 years 11 months ago
Update Rules for Parameter Estimation in Bayesian Networks
This paper re-examines the problem of parameter estimation in Bayesian networks with missing values and hidden variables from the perspective of recent work in on-line learning [1...
Eric Bauer, Daphne Koller, Yoram Singer
ATAL
2004
Springer
15 years 3 months ago
A Bayes Net Approach to Argumentation
Argumentation-based negotiation approaches have been proposed to present realistic negotiation contexts. This paper presents a novel Bayesian network based argumentation and decis...
Sabyasachi Saha, Sandip Sen
IJCAI
2001
14 years 11 months ago
Rational and Convergent Learning in Stochastic Games
This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...
Michael H. Bowling, Manuela M. Veloso