Search Sciweavers | Sciweavers

68 search results - page 3 / 14

» Feature-Discovering Approximate Value Iteration Methods

click to vote

ICA
2010
Springer

208views Signal Processing» more ICA 2010»

An Alternating Minimization Method for Sparse Channel Estimation

13 years 6 months ago

Download ee.sharif.ir

The problem of estimating a sparse channel, i.e. a channel with a few non-zero taps, appears in many fields of communication including acoustic underwater or wireless transmissions...

Rad Niazadeh, Massoud Babaie-Zadeh, Christian Jutt...

claim paper

Read More »

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

14 years 3 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

click to vote

NIPS
2000

121views Information Technology» more NIPS 2000»

APRICODD: Approximate Policy Construction Using Decision Diagrams

13 years 7 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and p...

Robert St-Aubin, Jesse Hoey, Craig Boutilier

claim paper

Read More »

click to vote

CORR
2010
Springer

170views Education» more CORR 2010»

Global Optimization for Value Function Approximation

13 years 6 months ago

Download www.cs.umass.edu

Existing value function approximation methods have been successfully used in many applications, but they often lack useful a priori error bounds. We propose a new approximate bili...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

13 years 6 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

« Prev « First page 3 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers