Sciweavers

4544 search results - page 23 / 909
» Reinforcement Learning with Time
Sort
View
JMLR
2010
119views more  JMLR 2010»
14 years 8 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
144
Voted
ICML
1998
IEEE
15 years 6 months ago
Learning to Drive a Bicycle Using Reinforcement Learning and Shaping
We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa(   )-algorithm. Then we solve the ...
Jette Randløv, Preben Alstrøm
COST
2009
Springer
185views Multimedia» more  COST 2009»
14 years 11 months ago
How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?
Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...
Ken Prepin, Philippe Gaussier
ATAL
2009
Springer
14 years 11 months ago
Replicator Dynamics for Multi-agent Learning: An Orthogonal Approach
Today's society is largely connected and many real life applications lend themselves to be modeled as multi-agent systems. Although such systems as well as their models are d...
Michael Kaisers, Karl Tuyls
110
Voted
ICML
2000
IEEE
16 years 2 months ago
Learning to Fly: An Application of Hierarchical Reinforcement Learning
Malcolm R. K. Ryan, Mark D. Reid