Search Sciweavers | Sciweavers

312 search results - page 15 / 63

» Learning Partially Observable Deterministic Action Models

Voted

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

14 years 10 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

14 years 11 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

click to vote

UAI
2008

224views Artificial Intelligence» more UAI 2008»

Sampling First Order Logical Particles

14 years 11 months ago

Download uai2008.cs.helsinki.fi

Approximate inference in dynamic systems is the problem of estimating the state of the system given a sequence of actions and partial observations. High precision estimation is fu...

Hannaneh Hajishirzi, Eyal Amir

claim paper

Read More »

click to vote

COLT
2007
Springer

104views Machine Learning» more COLT 2007»

Observational Learning in Random Networks

15 years 3 months ago

Download www.as.inf.ethz.ch

In the standard model of observational learning, n agents sequentially decide between two alternatives a or b, one of which is objectively superior. Their choice is based on a stoc...

Julian Lorenz, Martin Marciniszyn, Angelika Steger

claim paper

Read More »

109

click to vote

CVPR
2008
IEEE

304views Computer Vision» more CVPR 2008»

Context and observation driven latent variable model for human pose estimation

15 years 11 months ago

Download www.fxpal.com

Current approaches to pose estimation and tracking can be classified into two categories: generative and discriminative. While generative approaches can accurately determine human...

Abhinav Gupta, Trista Chen, Francine Chen, Don Kim...

claim paper

Read More »

« Prev « First page 15 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers