Search Sciweavers | Sciweavers

37 search results - page 4 / 8

» An analytic solution to discrete Bayesian reinforcement lear...

101

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 1 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

Voted

FUZZIEEE
2007
IEEE

132views Fuzzy Logic» more FUZZIEEE 2007»

Fuzzy Approximation for Convergent Model-Based Reinforcement Learning

15 years 6 months ago

Download www.montefiore.ulg.ac.be

— Reinforcement learning (RL) is a learning control paradigm that provides well-understood algorithms with good convergence and consistency properties. Unfortunately, these algor...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

135

click to vote

ABIALS
2008
Springer

255views Artificial Intelligence» more ABIALS 2008»

Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning

15 years 1 months ago

Download axon.cs.byu.edu

Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...

Matthias Rungger, Hao Ding, Olaf Stursberg

claim paper

Read More »

click to vote

NETCOOP
2007
Springer

130views Computer Networks» more NETCOOP 2007»

Load Shared Sequential Routing in MPLS Networks: System and User Optimal Solutions

15 years 5 months ago

Download www.tsp.ece.mcgill.ca

Recently Gerald Ash has shown through case studies that event dependent routing is attractive in large scale multi-service MPLS networks. In this paper, we consider the application...

Gilles Brunet, Fariba Heidari, Lorne Mason

claim paper

Read More »

112

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

14 years 12 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

« Prev « First page 4 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers