Search Sciweavers | Sciweavers

2415 search results - page 151 / 483

» Markov Processes on Curves

Voted

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 5 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

116

Voted

CISS
2007
IEEE

189views Information Technology» more CISS 2007»

Consensus Estimation via Belief Propagation

15 years 2 months ago

Download www4.ncsu.edu

Abstract –In this paper, a new problem, consensus estimation, is formulated, whose setting is complementary to the well-known CEO problem. In particular, a set of nodes are emplo...

Huaiyu Dai, Yanbing Zhang

claim paper

Read More »

Voted

AIPS
2006

161views Artificial Intelligence» more AIPS 2006»

Automated Planning Using Quantum Computation

15 years 2 months ago

Download www.aaai.org

This paper presents an adaptation of the standard quantum search technique to enable application within Dynamic Programming, in order to optimise a Markov Decision Process. This i...

Sanjeev Naguleswaran, Langford B. White, I. Fuss

claim paper

Read More »

click to vote

AIPS
2003

149views Artificial Intelligence» more AIPS 2003»

Synthesis of Hierarchical Finite-State Controllers for POMDPs

15 years 2 months ago

Download www.aaai.org

We develop a hierarchical approach to planning for partially observable Markov decision processes (POMDPs) in which a policy is represented as a hierarchical ﬁnite-state control...

Eric A. Hansen, Rong Zhou

claim paper

Read More »

click to vote

AAAI
2000

176views Intelligent Agents» more AAAI 2000»

Decision-Theoretic, High-Level Agent Programming in the Situation Calculus

15 years 2 months ago

Download www.aaai.org

We propose a frameworkfor robot programming which allows the seamless integration of explicit agent programming with decision-theoretic planning. Specifically, the DTGolog model a...

Craig Boutilier, Raymond Reiter, Mikhail Soutchans...

claim paper

Read More »

« Prev « First page 151 / 483 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers