Search Sciweavers | Sciweavers

332 search results - page 67 / 67

» Ranking policies in discrete Markov decision processes

click to vote

CACM
1998

103views more CACM 1998»

The Virtual Design Team

13 years 4 months ago

Download www.stanford.edu

The long range goal of the “Virtual Design Team” (VDT) research program is to develop computational tools to analyze decision making and communication behavior and thereby to ...

John C. Kunz, Tore R. Christiansen, Geoff P. Cohen...

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 5 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 67 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers