Sciweavers

3274 search results - page 189 / 655
» Using Learning in a Control Agent
Sort
View
ECML
2005
Springer
16 years 6 days ago
Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes
Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...
Masoumeh T. Izadi, Doina Precup
AIED
2007
Springer
16 years 26 days ago
Does Learner Control Affect Learning?
Many intelligent tutoring systems permit some degree of learner control. A natural question is whether the increased student engagement and motivation such control provides results...
Joseph E. Beck
ATAL
2003
Springer
15 years 12 months ago
How to calm hyperactive agents
System performance in multi-agent resource allocation systems can often improve if individual agents reduce their activity. Agents in such systems need a way to modulate their ind...
H. Van Dyke Parunak, Sven Brueckner, Robert S. Mat...
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
16 years 1 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
GECCO
2007
Springer
186views Optimization» more  GECCO 2007»
16 years 26 days ago
Evolving controllers for simulated car racing using object oriented genetic programming
Several different controller representations are compared on a non-trivial problem in simulated car racing, with respect to learning speed and final fitness. The controller rep...
Alexandros Agapitos, Julian Togelius, Simon M. Luc...