Sciweavers

1233 search results - page 73 / 247
» Reinforcement Learning in MirrorBot
Sort
View
BC
2008
56views more  BC 2008»
15 years 1 months ago
An implementation of reinforcement learning based on spike timing dependent plasticity
Patrick D. Roberts, Roberto A. Santiago, Gerardo L...
IJAIT
2008
60views more  IJAIT 2008»
15 years 1 months ago
A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion
Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...
CORR
2007
Springer
73views Education» more  CORR 2007»
15 years 1 months ago
Universal Reinforcement Learning
—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence futu...
Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...