Search Sciweavers | Sciweavers

162 search results - page 14 / 33

» Off-Policy Temporal Difference Learning with Function Approx...

178

Voted

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

15 years 8 months ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

162

Voted

COLT
1994
Springer

108views Machine Learning» more COLT 1994»

Lower Bounds on the VC-Dimension of Smoothly Parametrized Function Classes

15 years 10 months ago

Download users.cecs.anu.edu.au

We examine the relationship between the VCdimension and the number of parameters of a smoothly parametrized function class. We show that the VC-dimension of such a function class ...

Wee Sun Lee, Peter L. Bartlett, Robert C. Williams...

claim paper

Read More »

175

Voted

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

175

click to vote

ICML
2009
IEEE

120views Machine Learning» more ICML 2009»

Learning linear dynamical systems without sequence information

16 years 1 months ago

Download www.cs.mcgill.ca

Virtually all methods of learning dynamic systems from data start from the same basic assumption: that the learning algorithm will be provided with a sequence, or trajectory, of d...

Tzu-Kuo Huang, Jeff Schneider

claim paper

Read More »

182

Voted

ICML
2008
IEEE

156views Machine Learning» more ICML 2008»

Gaussian process product models for nonparametric nonstationarity

16 years 7 months ago

Download eprints.pascal-network.org

Stationarity is often an unrealistic prior assumption for Gaussian process regression. One solution is to predefine an explicit nonstationary covariance function, but such covaria...

Ryan Prescott Adams, Oliver Stegle

claim paper

Read More »

« Prev « First page 14 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers