Sciweavers

4345 search results - page 181 / 869
» Relational Reinforcement Learning
Sort
View
160
Voted
ICML
2000
IEEE
16 years 4 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
136
Voted
ECML
2004
Springer
15 years 9 months ago
Experiments in Value Function Approximation with Sparse Support Vector Regression
Abstract. We present first experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...
Tobias Jung, Thomas Uthmann
133
Voted
CSREAEEE
2008
199views Business» more  CSREAEEE 2008»
15 years 5 months ago
Progranimate - A Web Enabled Algorithmic Problem Solving Application
- This paper proposes the use of an interactive web based problem solving application that utilises flowchart based programming and code generation to address the issues faced by n...
Andrew Scott, Mike Watkins, Duncan McPhee
137
Voted
ICDM
2008
IEEE
150views Data Mining» more  ICDM 2008»
15 years 10 months ago
Pseudolikelihood EM for Within-network Relational Learning
In this work, we study the problem of within-network relational learning and inference, where models are learned on a partially labeled relational dataset and then are applied to ...
Rongjing Xiang, Jennifer Neville
119
Voted
ECML
2005
Springer
15 years 9 months ago
Model-Based Online Learning of POMDPs
Abstract. Learning to act in an unknown partially observable domain is a difficult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony