Sciweavers

24 search results - page 4 / 5
» Reducing reinforcement learning to KWIK online regression
Sort
View
ATAL
2008
Springer
13 years 7 months ago
Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies
Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...
Thomas Gabel, Martin A. Riedmiller
NIPS
2007
13 years 7 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
ECML
2006
Springer
13 years 9 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....
SIGIR
2008
ACM
13 years 5 months ago
A bayesian logistic regression model for active relevance feedback
Relevance feedback, which traditionally uses the terms in the relevant documents to enrich the user's initial query, is an effective method for improving retrieval performanc...
Zuobing Xu, Ram Akella
AIS
2006
Springer
13 years 5 months ago
Context enhancement for co-intentionality and co-reference in asynchronous CMC
The regulative and semantic `distance' of electronic conferencing may impede the topical alignment and the unambiguous interpretation of messages, hindering collaborative lear...
J. van der Pol, Wilfried Admiraal, P. Simons