Search Sciweavers | Sciweavers

246 search results - page 25 / 50

» Learning behavior styles with inverse reinforcement learning

101

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 1 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

149

click to vote

AGENTS
2001
Springer

247views Security Privacy» more AGENTS 2001»

Hierarchical multi-agent reinforcement learning

15 years 4 months ago

Download www-anw.cs.umass.edu

In this paper, we investigate the use of hierarchical reinforcement learning (HRL) to speed up the acquisition of cooperative multi-agent tasks. We introduce a hierarchical multi-a...

Rajbala Makar, Sridhar Mahadevan, Mohammad Ghavamz...

claim paper

Read More »

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

14 years 11 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

15 years 24 days ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

175

click to vote

NIPS
2008

149views Information Technology» more NIPS 2008»

Optimization on a Budget: A Reinforcement Learning Approach

15 years 1 months ago

Download www.cs.arizona.edu

Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...

Paul Ruvolo, Ian R. Fasel, Javier R. Movellan

claim paper

Read More »

« Prev « First page 25 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers