Sciweavers
Explore
Publications
Books
Software
Tutorials
Presentations
Lectures Notes
Datasets
Labs
Conferences
Community
Upcoming
Conferences
Top Ranked Papers
Most Viewed Conferences
Conferences by Acronym
Conferences by Subject
Conferences by Year
Tools
PDF Tools
Image Tools
Text Tools
OCR Tools
Symbol and Emoji Tools
On-screen Keyboard
Latex Math Equation to Image
Smart IPA Phonetic Keyboard
Community
Sciweavers
About
Terms of Use
Privacy Policy
Cookies
1799
search results - page 20 / 360
»
Filtered Reinforcement Learning
Sort
relevance
views
votes
recent
update
View
thumb
title
101
click to vote
ECML
1998
Springer
85
views
Machine Learning
»
more
ECML 1998
»
Theoretical Results on Reinforcement Learning with Temporally Abstract Options
15 years 7 months ago
Download
webdocs.cs.ualberta.ca
Doina Precup, Richard S. Sutton, Satinder P. Singh
claim paper
Read More »
101
click to vote
EWRL
2008
133
views
Machine Learning
»
more
EWRL 2008
»
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning
15 years 4 months ago
Download
ewrl08.futurs.inria.fr
Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...
claim paper
Read More »
90
click to vote
ML
2008
ACM
95
views
Machine Learning
»
more
ML 2008
»
Transfer in variable-reward hierarchical reinforcement learning
15 years 3 months ago
Download
web.engr.oregonstate.edu
Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...
claim paper
Read More »
83
click to vote
ML
2000
ACM
133
views
Machine Learning
»
more
ML 2000
»
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms
15 years 2 months ago
Download
www.cs.rutgers.edu
Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...
claim paper
Read More »
209
click to vote
ICAART
2010
INSTICC
509
views
Intelligent Agents
»
more
ICAART 2010
»
Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning
16 years 3 days ago
Download
arxiv.org
There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...
Christos Dimitrakakis
posted by
olethros
Read More »
« Prev
« First
page 20 / 360
Last »
Next »