Search Sciweavers | Sciweavers

252 search results - page 11 / 51

» Learning Partially Observable Action Models: Efficient Algor...

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 1 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

click to vote

CVPR
2009
IEEE

385views Computer Vision» more CVPR 2009»

Learning to Track with Multiple Observers

16 years 6 months ago

Download mi.eng.cam.ac.uk

We propose a novel approach to designing algorithms for object tracking based on fusing multiple observation models. As the space of possible observation models is too large for...

Björn Stenger, Roberto Cipolla, Thomas Woodle...

claim paper

Read More »

101

click to vote

ICML
2000
IEEE

214views Machine Learning» more ICML 2000»

Learning Probabilistic Models for Decision-Theoretic Navigation of Mobile Robots

16 years 14 days ago

Download www.ri.cmu.edu

Decision-theoretic reasoning and planning algorithms are increasingly being used for mobile robot navigation, due to the signi cant uncertainty accompanying the robots' perce...

Daniel Nikovski, Illah R. Nourbakhsh

claim paper

Read More »

116

click to vote

AAAI
2000

144views Intelligent Agents» more AAAI 2000»

Back to the Future for Consistency-Based Trajectory Tracking

15 years 1 months ago

Download people.csail.mit.edu

Given a model of a physical process and a sequence of commands and observations received over time, the task of an autonomous controller is to determine the likely states of the p...

James Kurien, P. Pandurang Nayak

claim paper

Read More »

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 6 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

« Prev « First page 11 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers