Search Sciweavers | Sciweavers

575 search results - page 64 / 115

» Reinforcement Learning State Estimator

109

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 4 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

111

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

15 years 10 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

click to vote

BC
2008

134views more BC 2008»

Interacting with an artificial partner: modeling the role of emotional aspects

14 years 10 months ago

Download homes.dsi.unimi.it

In this paper we introduce a simple model based on probabilistic finite state automata to describe an emotional interaction between a robot and a human user, or between simulated a...

Isabella Cattinelli, Massimiliano Goldwurm, N. Alb...

claim paper

Read More »

click to vote

AGI
2008

136views Artificial Intelligence» more AGI 2008»

An Integrative Methodology for Teaching Embodied Non-Linguistic Agents, Applied to Virtual Animals in Second Life

14 years 11 months ago

Download www.novamente.net

A teaching methodology called Imitative-Reinforcement-Corrective (IRC) learning is described, and proposed as a general approach for teaching embodied non-linguistic AGI systems. I...

Ben Goertzel, Cassio Pennachin, Nil Geisweiller, M...

claim paper

Read More »

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

14 years 11 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

« Prev « First page 64 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers