Search Sciweavers | Sciweavers

55 search results - page 6 / 11

» Approximate Policy Iteration using Large-Margin Classifiers

115

click to vote

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

13 years 7 months ago

Download www.mit.edu

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

110

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

15 years 5 months ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

Voted

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

14 years 11 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

15 years 1 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

click to vote

ICPR
2006
IEEE

140views computer vision» more ICPR 2006»

A Non-Iterative Approach to Reconstruct Face Templates from Match Scores

16 years 20 days ago

Download marathon.csee.usf.edu

Regeneration of biometric templates from match scores has security and privacy implications related to any biometric based authentication system. In this paper, we propose a novel...

Pranab Mohanty, Rangachar Kasturi, Sudeep Sarkar

claim paper

Read More »

« Prev « First page 6 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers