Search Sciweavers | Sciweavers

40 search results - page 5 / 8

» Learning Partially Observable Action Schemas

112

click to vote

TBILLC
2005
Springer

153views Natural Language Processing» more TBILLC 2005»

Real World Multi-agent Systems: Information Sharing, Coordination and Planning

15 years 5 months ago

Download www.science.uva.nl

Abstract. Applying multi-agent systems in real world scenarios requires several essential research questions to be answered. Agents have to perceive their environment in order to t...

Frans C. A. Groen, Matthijs T. J. Spaan, Jelle R. ...

claim paper

Read More »

click to vote

IROS
2006
IEEE

121views Robotics» more IROS 2006»

Planning and Acting in Uncertain Environments using Probabilistic Inference

15 years 5 months ago

Download www.cs.washington.edu

— An important problem in robotics is planning and selecting actions for goal-directed behavior in noisy uncertain environments. The problem is typically addressed within the fra...

Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 29 days ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 1 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 5 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

« Prev « First page 5 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers