Search Sciweavers | Sciweavers

267 search results - page 28 / 54

» The Dynamics of Multi-Agent Reinforcement Learning

127

click to vote

AAAI
1998

181views Intelligent Agents» more AAAI 1998»

Applying Online Search Techniques to Continuous-State Reinforcement Learning

15 years 2 months ago

Download www.autonlab.org

In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...

Scott Davies, Andrew Y. Ng, Andrew W. Moore

claim paper

Read More »

116

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

14 years 11 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

109

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

15 years 2 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

105

click to vote

IROS
2006
IEEE

187views Robotics» more IROS 2006»

Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic

15 years 7 months ago

Download hawaii.aist-nara.ac.jp

— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...

Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...

claim paper

Read More »

141

click to vote

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 3 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

« Prev « First page 28 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers