Sciweavers

6 search results - page 2 / 2
» Unknown Rewards in Finite-Horizon Domains
Sort
View
ECML
2007
Springer
13 years 9 months ago
Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs
Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...
Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass