Search Sciweavers | Sciweavers

24 search results - page 4 / 5

» Reducing reinforcement learning to KWIK online regression

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 7 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

13 years 9 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

click to vote

SIGIR
2008
ACM

167views Information Technology» more SIGIR 2008»

A bayesian logistic regression model for active relevance feedback

13 years 5 months ago

Download users.soe.ucsc.edu

Relevance feedback, which traditionally uses the terms in the relevant documents to enrich the user's initial query, is an effective method for improving retrieval performanc...

Zuobing Xu, Ram Akella

claim paper

Read More »

click to vote

AIS
2006
Springer

83views Artificial Intelligence» more AIS 2006»

Context enhancement for co-intentionality and co-reference in asynchronous CMC

13 years 5 months ago

Download www.ilo.uva.nl

The regulative and semantic `distance' of electronic conferencing may impede the topical alignment and the unambiguous interpretation of messages, hindering collaborative lear...

J. van der Pol, Wilfried Admiraal, P. Simons

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers