Search Sciweavers | Sciweavers

813 search results - page 32 / 163

» Ensemble Algorithms in Reinforcement Learning

139

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 5 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

218

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

16 years 27 days ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

143

Voted

SIGGRAPH
2010
ACM

295views Computer Graphics» more SIGGRAPH 2010»

Learning behavior styles with inverse reinforcement learning

15 years 8 months ago

Download grail.cs.washington.edu

We present a method for inferring the behavior styles of character controllers from a small set of examples. We show that a rich set of behavior variations can be captured by dete...

Seong Jae Lee, Zoran Popovic

claim paper

Read More »

142

Voted

ICANN
2003
Springer

161views Neural Networks» more ICANN 2003»

Confidence Estimation Using the Incremental Learning Algorithm, Learn++

15 years 9 months ago

Download users.rowan.edu

Pattern recognition problems span a broad range of applications, where each application has its own tolerance on classification error. The varying levels of risk associated with ma...

Jeffrey Byorick, Robi Polikar

claim paper

Read More »

148

Voted

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

15 years 5 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

« Prev « First page 32 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers