Search Sciweavers | Sciweavers

267 search results - page 39 / 54

» The Dynamics of Multi-Agent Reinforcement Learning

click to vote

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Modeling Human Decision Making in Cliff-Edge Environments

15 years 1 months ago

Download www.aaai.org

In this paper we propose a model for human learning and decision making in environments of repeated Cliff-Edge (CE) interactions. In CE environments, which include common daily in...

Ron Katz, Sarit Kraus

claim paper

Read More »

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

14 years 11 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

click to vote

ICML
2008
IEEE

162views Machine Learning» more ICML 2008»

Automatic discovery and transfer of MAXQ hierarchies

16 years 16 days ago

Download pages.cs.wisc.edu

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...

Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...

claim paper

Read More »

105

click to vote

AIIDE
2007

159views Artificial Intelligence» more AIIDE 2007»

Automatic Rule Ordering for Dynamic Scripting

15 years 2 months ago

Download ticc.uvt.nl

The goal of adaptive game AI is to enhance computercontrolled game-playing agents with (1) the ability to selfcorrect mistakes, and (2) creativity in responding to new situations....

Timor Timuri, Pieter Spronck, H. Jaap van den Heri...

claim paper

Read More »

click to vote

QRE
2010

129views more QRE 2010»

Improving quality of prediction in highly dynamic environments using approximate dynamic programming

14 years 10 months ago

Download mason.gmu.edu

In many applications, decision making under uncertainty often involves two steps- prediction of a certain quality parameter or indicator of the system under study and the subseque...

Rajesh Ganesan, Poornima Balakrishna, Lance Sherry

claim paper

Read More »

« Prev « First page 39 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers