Search Sciweavers | Sciweavers

1249 search results - page 202 / 250

» State Machine Modeling: From Synch States to Synchronized St...

123

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 18 days ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

133

click to vote

RV
2010
Springer

220views Hardware» more RV 2010»

Runtime Verification with the RV System

14 years 9 months ago

Download fsl.cs.uiuc.edu

The RV system is the first system to merge the benefits of Runtime Monitoring with Predictive Analysis. The Runtime Monitoring portion of RV is based on the successful Monitoring O...

Patrick O'Neil Meredith, Grigore Rosu

claim paper

Read More »

128

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 18 days ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

100

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 3 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

15 years 5 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

« Prev « First page 202 / 250 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers