Search Sciweavers | Sciweavers

515 search results - page 34 / 103

» Approximating Markov Processes by Averaging

117

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

15 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

101

click to vote

ICML
2007
IEEE

204views Machine Learning» more ICML 2007»

Constructing basis functions from directed graphs for value function approximation

16 years 2 months ago

Download www.machinelearning.org

Basis functions derived from an undirected graph connecting nearby samples from a Markov decision process (MDP) have proven useful for approximating value functions. The success o...

Jeffrey Johns, Sridhar Mahadevan

claim paper

Read More »

102

Voted

NIPS
2000

121views Information Technology» more NIPS 2000»

APRICODD: Approximate Policy Construction Using Decision Diagrams

15 years 3 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and p...

Robert St-Aubin, Jesse Hoey, Craig Boutilier

claim paper

Read More »

107

click to vote

ALT
2006
Springer

111views Machine Learning» more ALT 2006»

Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence

15 years 11 months ago

Download www.idsia.ch

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...

Daniil Ryabko, Marcus Hutter

claim paper

Read More »

137

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

14 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

« Prev « First page 34 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers