Search Sciweavers | Sciweavers

17 search results - page 3 / 4

» Gradient-Based Relational Reinforcement Learning of Temporal...

click to vote

ROBOCUP
2009
Springer

134views Robotics» more ROBOCUP 2009»

Learning Complementary Multiagent Behaviors: A Case Study

14 years 18 days ago

Download teamcore.usc.edu

As the reach of multiagent reinforcement learning extends to more and more complex tasks, it is likely that the diverse challenges posed by some of these tasks can only be address...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 6 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

13 years 7 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

click to vote

CG
2000
Springer

150views Computer Graphics» more CG 2000»

Chess Neighborhoods, Function Combination, and Reinforcement Learning

13 years 10 months ago

Download users.soe.ucsc.edu

Abstract. Over the years, various research projects have attempted to develop a chess program that learns to play well given little prior knowledge beyond the rules of the game. Ea...

Robert Levinson, Ryan Weber

claim paper

Read More »

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

14 years 7 days ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers