Search Sciweavers | Sciweavers

3837 search results - page 61 / 768

» Learning Approximate Consistencies

103

click to vote

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

16 years 2 months ago

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

click to vote

ICML
2009
IEEE

123views Machine Learning» more ICML 2009»

Constraint relaxation in approximate linear programs

16 years 2 months ago

Download anytime.cs.umass.edu

Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

ICML
2009
IEEE

124views Machine Learning» more ICML 2009»

On sampling-based approximate spectral decomposition

16 years 2 months ago

Download www.cs.nyu.edu

This paper addresses the problem of approximate singular value decomposition of large dense matrices that arises naturally in many machine learning applications. We discuss two re...

Sanjiv Kumar, Mehryar Mohri, Ameet Talwalkar

claim paper

Read More »

101

click to vote

ICML
2007
IEEE

204views Machine Learning» more ICML 2007»

Constructing basis functions from directed graphs for value function approximation

16 years 2 months ago

Download www.machinelearning.org

Basis functions derived from an undirected graph connecting nearby samples from a Markov decision process (MDP) have proven useful for approximating value functions. The success o...

Jeffrey Johns, Sridhar Mahadevan

claim paper

Read More »

click to vote

IJCNN
2007
IEEE

82views Neural Networks» more IJCNN 2007»

Range Data Approximation for Mobile Robot by Using CAN2

15 years 8 months ago

Download lab.cntl.kyutech.ac.jp

— In this article, we apply the competitive associative net called CAN2 to the processing of the range data of indoor environment acquired by a mobile robot, where the CAN2 is a ...

Takeshi Nishida, Shuichi Kurogi, Yuji Takemura, Hi...

claim paper

Read More »

« Prev « First page 61 / 768 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers