Search Sciweavers | Sciweavers

252 search results - page 34 / 51

» Learning Partially Observable Action Models: Efficient Algor...

144

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 4 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

128

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 4 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

135

click to vote

RAS
2000

161views more RAS 2000»

Active object recognition by view integration and reinforcement learning

15 years 2 months ago

Download www.emt.tu-graz.ac.at

A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...

Lucas Paletta, Axel Pinz

claim paper

Read More »

142

click to vote

ICWL
2005
Springer

137views Internet Technology» more ICWL 2005»

The Research of Mining Association Rules Between Personality and Behavior of Learner Under Web-Based Learning Environment

15 years 8 months ago

Download www.uu.edu

: Discovering the relationship between behavior and personality of learner in the web-based learning environment is a key to guide learners in the learning process. This paper prop...

Jin Du, Qinghua Zheng, Haifei Li, Wenbin Yuan

claim paper

Read More »

127

click to vote

ATAL
2007
Springer

129views Intelligent Agents» more ATAL 2007»

Subjective approximate solutions for decentralized POMDPs

15 years 9 months ago

Download www.cs.cmu.edu

A problem of planning for cooperative teams under uncertainty is a crucial one in multiagent systems. Decentralized partially observable Markov decision processes (DECPOMDPs) prov...

Anton Chechetka, Katia P. Sycara

claim paper

Read More »

« Prev « First page 34 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers