Sciweavers

2108 search results - page 131 / 422
» Tracking in Reinforcement Learning
Sort
View
97
Voted
BC
2008
56views more  BC 2008»
15 years 3 months ago
An implementation of reinforcement learning based on spike timing dependent plasticity
Patrick D. Roberts, Roberto A. Santiago, Gerardo L...
70
Voted
COLING
2008
15 years 3 months ago
Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets
James Henderson, Oliver Lemon, Kallirroi Georgila
117
Voted
IJAIT
2008
60views more  IJAIT 2008»
15 years 3 months ago
A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion
Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...
CORR
2007
Springer
73views Education» more  CORR 2007»
15 years 3 months ago
Universal Reinforcement Learning
—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence futu...
Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...