Sciweavers

4255 search results - page 160 / 851
» On Learning Boolean Functions
Sort
View
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
15 years 4 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
ICML
1998
IEEE
15 years 11 months ago
The MAXQ Method for Hierarchical Reinforcement Learning
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...
Thomas G. Dietterich
IJCAI
2007
14 years 11 months ago
Bayesian Inverse Reinforcement Learning
Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...
Deepak Ramachandran, Eyal Amir
ICML
2003
IEEE
15 years 3 months ago
The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy
Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...
Clifford Kotnik, Jugal K. Kalita
ECTEL
2006
Springer
15 years 1 months ago
Community Based Software Development - the Case of Movelex
Abstract. The paper provides an overview of the elaboration, testing and improvement of Movelex, a complex virtual learning environment (VLE) supporting the establishment of self-r...
Kornél Varga, Andrea Kárpáti