Search Sciweavers | Sciweavers

252 search results - page 35 / 51

» Learning Partially Observable Action Models: Efficient Algor...

183

click to vote

ATAL
2006
Springer

107views Intelligent Agents» more ATAL 2006»

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces

15 years 10 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...

Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...

claim paper

Read More »

175

click to vote

ICML
2006
IEEE

137views Machine Learning» more ICML 2006»

Predictive linear-Gaussian models of controlled stochastic dynamical systems

16 years 7 months ago

Download www.rudary.com

We introduce the controlled predictive linearGaussian model (cPLG), a model that uses predictive state to model discrete-time dynamical systems with real-valued observations and v...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

189

click to vote

ICMAS
1998

78views Intelligent Agents» more ICMAS 1998»

How to Explore your Opponent's Strategy (almost) Optimally

15 years 8 months ago

Download www.cs.technion.ac.il

This work presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent's behavior during interaction in a multi-a...

David Carmel, Shaul Markovitch

claim paper

Read More »

185

click to vote

CORR
2006
Springer

101views Education» more CORR 2006»

Metric State Space Reinforcement Learning for a Vision-Capable Mobile Robot

15 years 7 months ago

Download www.idsia.ch

We address the problem of autonomously learning controllers for visioncapable mobile robots. We extend McCallum's (1995) Nearest-Sequence Memory algorithm to allow for genera...

Viktor Zhumatiy, Faustino J. Gomez, Marcus Hutter,...

claim paper

Read More »

198

click to vote

UAI
2008

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

15 years 8 months ago

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...

Gregory Lawrence, Stuart J. Russell

claim paper

Read More »

« Prev « First page 35 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers