Sciweavers

575 search results - page 39 / 115
» Reinforcement Learning State Estimator
Sort
View
ICML
1999
IEEE
15 years 10 months ago
Implicit Imitation in Multiagent Reinforcement Learning
Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...
Bob Price, Craig Boutilier
EWRL
2008
14 years 11 months ago
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case
We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...
Kirill Dyagilev, Shie Mannor, Nahum Shimkin
ECAI
2010
Springer
14 years 11 months ago
The Dynamics of Multi-Agent Reinforcement Learning
Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...
Luke Dickens, Krysia Broda, Alessandra Russo
ML
2002
ACM
121views Machine Learning» more  ML 2002»
14 years 9 months ago
Near-Optimal Reinforcement Learning in Polynomial Time
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...
Michael J. Kearns, Satinder P. Singh
ECML
2004
Springer
15 years 3 months ago
Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework
Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...
Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...