Search Sciweavers | Sciweavers

226 search results - page 25 / 46

» Linear Bayesian Reinforcement Learning

174

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

16 years 22 days ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

169

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 8 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

165

click to vote

ICML
2006
IEEE

161views Machine Learning» more ICML 2006»

Bayesian learning of measurement and structural models

16 years 7 months ago

Download www.statslab.cam.ac.uk

We present a Bayesian search algorithm for learning the structure of latent variable models of continuous variables. We stress the importance of applying search operators designed...

Ricardo Silva, Richard Scheines

claim paper

Read More »

154

click to vote

GECCO
2006
Springer

177views Optimization» more GECCO 2006»

Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure

15 years 10 months ago

Download www.eskimo.com

The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

168

click to vote

ML
2010
ACM

151views Machine Learning» more ML 2010»

Inductive transfer for learning Bayesian networks

15 years 4 months ago

Download ccc.inaoep.mx

In several domains it is common to have data from different, but closely related problems. For instance, in manufacturing, many products follow the same industrial process but with...

Roger Luis, Luis Enrique Sucar, Eduardo F. Morales

claim paper

Read More »

« Prev « First page 25 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers