Sciweavers

4544 search results - page 56 / 909
» Reinforcement Learning with Time
Sort
View
143
Voted
ICML
2005
IEEE
16 years 4 months ago
Exploration and apprenticeship learning in reinforcement learning
We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...
Pieter Abbeel, Andrew Y. Ng
118
Voted
CEC
2010
IEEE
15 years 4 months ago
Learning to overtake in TORCS using simple reinforcement learning
In modern racing games programming non-player characters with believable and sophisticated behaviors is getting increasingly challenging. Recently, several works in the literature ...
Daniele Loiacono, Alessandro Prete, Pier Luca Lanz...
156
Voted
AAAI
2006
15 years 5 months ago
Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance
As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...
Andrea Lockerd Thomaz, Cynthia Breazeal
ECAI
2008
Springer
15 years 5 months ago
Learning to Select Object Recognition Methods for Autonomous Mobile Robots
Selecting which algorithms should be used by a mobile robot computer vision system is a decision that is usually made a priori by the system developer, based on past experience and...
Reinaldo A. C. Bianchi, Arnau Ramisa, Ramon L&oacu...
176
Voted
ESANN
2008
15 years 5 months ago
Learning to play Tetris applying reinforcement learning methods
In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...
Alexander Groß, Jan Friedland, Friedhelm Sch...