Sciweavers

267 search results - page 39 / 54
» The Dynamics of Multi-Agent Reinforcement Learning
Sort
View
88
Voted
AAAI
2006
14 years 11 months ago
Modeling Human Decision Making in Cliff-Edge Environments
In this paper we propose a model for human learning and decision making in environments of repeated Cliff-Edge (CE) interactions. In CE environments, which include common daily in...
Ron Katz, Sarit Kraus
ML
1998
ACM
136views Machine Learning» more  ML 1998»
14 years 9 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
78
Voted
ICML
2008
IEEE
15 years 10 months ago
Automatic discovery and transfer of MAXQ hierarchies
We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...
Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...
AIIDE
2007
14 years 12 months ago
Automatic Rule Ordering for Dynamic Scripting
The goal of adaptive game AI is to enhance computercontrolled game-playing agents with (1) the ability to selfcorrect mistakes, and (2) creativity in responding to new situations....
Timor Timuri, Pieter Spronck, H. Jaap van den Heri...
QRE
2010
129views more  QRE 2010»
14 years 8 months ago
Improving quality of prediction in highly dynamic environments using approximate dynamic programming
In many applications, decision making under uncertainty often involves two steps- prediction of a certain quality parameter or indicator of the system under study and the subseque...
Rajesh Ganesan, Poornima Balakrishna, Lance Sherry