Search Sciweavers | Sciweavers

21 search results - page 4 / 5

» Variance Reduction Techniques for Gradient Estimates in Rein...

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

13 years 16 days ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

13 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

click to vote

CORR
2010
Springer

124views Education» more CORR 2010»

Online Learning of Noisy Data with Kernels

13 years 5 months ago

Download homes.dsi.unimi.it

We study online learning when individual instances are corrupted by adversarially chosen random noise. We assume the noise distribution is unknown, and may change over time with n...

Nicolò Cesa-Bianchi, Shai Shalev-Shwartz, O...

claim paper

Read More »

click to vote

NIPS
2008

188views Information Technology» more NIPS 2008»

Bayesian Kernel Shaping for Learning Control

13 years 7 months ago

Download eprints.pascal-network.org

In kernel-based regression learning, optimizing each kernel individually is useful when the data density, curvature of regression surfaces (or decision boundaries) or magnitude of...

Jo-Anne Ting, Mrinal Kalakrishnan, Sethu Vijayakum...

claim paper

Read More »

click to vote

ICML
2000
IEEE

114views Machine Learning» more ICML 2000»

Complete Cross-Validation for Nearest Neighbor Classifiers

14 years 6 months ago

Download www-2.cs.cmu.edu

Cross-validation is an established technique for estimating the accuracy of a classifier and is normally performed either using a number of random test/train partitions of the dat...

Matthew D. Mullin, Rahul Sukthankar

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers