Sciweavers

2108 search results - page 281 / 422
» Tracking in Reinforcement Learning
Sort
View
144
Voted
EWCBR
2008
Springer
15 years 5 months ago
Discovering Feature Weights for Feature-based Indexing of Q-tables
In this paper we propose an approach to address the old problem of identifying the feature conditions under which a gaming strategy can be effective. For doing this, we will build ...
Chad Hogg, Stephen Lee-Urban, Bryan Auslander, H&e...
AIPS
2006
15 years 5 months ago
Reusing and Building a Policy Library
Policy Reuse is a method to improve reinforcement learning with the ability to solve multiple tasks by building upon past problem solving experience, as accumulated in a Policy Li...
Fernando Fernández, Manuela M. Veloso
129
Voted
NN
2006
Springer
140views Neural Networks» more  NN 2006»
15 years 3 months ago
Neural mechanism for stochastic behaviour during a competitive game
Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...
Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang
143
Voted
TSMC
2008
135views more  TSMC 2008»
15 years 3 months ago
Wholesale Power Price Dynamics Under Transmission Line Limits: A Use of an Agent-Based Intelligent Simulator
Abstract--This research proposes a use of an agent-based intelligent simulator to numerically examine the influence of a transmission line limit on the dynamics of a wholesale powe...
Toshiyuki Sueyoshi, Gopalakrishna Reddy Tadiparthi
120
Voted
AI
2002
Springer
15 years 3 months ago
Programming backgammon using self-teaching neural nets
TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...
Gerald Tesauro