Search Sciweavers | Sciweavers

138 search results - page 3 / 28

» Dynamic Programming for Structured Continuous Markov Decisio...

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

14 years 9 days ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

click to vote

ICML
2010
IEEE

223views Machine Learning» more ICML 2010»

Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes

13 years 6 months ago

Download anytime.cs.umass.edu

Approximate dynamic programming has been used successfully in a large variety of domains, but it relies on a small set of provided approximation features to calculate solutions re...

Marek Petrik, Gavin Taylor, Ronald Parr, Shlomo Zi...

claim paper

Read More »

click to vote

CORR
2011
Springer

175views Education» more CORR 2011»

Adaptive Channel Recommendation for Dynamic Spectrum Access

13 years 19 days ago

Download home.ie.cuhk.edu.hk

—We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average rewa...

Xu Chen, Jianwei Huang, Husheng Li

claim paper

Read More »

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

13 years 9 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

click to vote

CCE
2004

136views Software Engineering» more CCE 2004»

Continuous reformulations of discrete-continuous optimization problems

13 years 5 months ago

Download kop.ior.kit.edu

This paper treats the solution of nonlinear optimization problems involving discrete decision variables, also known as generalized disjunctive programming (GDP) or mixed-integer n...

Oliver Stein, Jan Oldenburg, Wolfgang Marquardt

claim paper

Read More »

« Prev « First page 3 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers