Search Sciweavers | Sciweavers

102 search results - page 8 / 21

» Efficient Asymptotic Approximation in Temporal Difference Le...

126

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 5 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

114

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

14 years 6 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

click to vote

BC
2005

71views more BC 2005»

The spatiotemporal learning rule and its efficiency in separating spatiotemporal patterns

14 years 11 months ago

Download ece.ut.ac.ir

The hippocampus plays an important role in the course of establishing long-term memory, i.e., to make short-term memory of spatially and temporally associated input information. In...

M. Tsukada, X. Pan

claim paper

Read More »

click to vote

CPAIOR
2006
Springer

125views Operations Research» more CPAIOR 2006»

An Efficient Hybrid Strategy for Temporal Planning

15 years 3 months ago

Download www.cse.wustl.edu

Temporal planning (TP) is notoriously difficult because it requires to solve a propositional STRIPS planning problem with temporal constraints. In this paper, we propose an efficie...

Zhao Xing, Yixin Chen, Weixiong Zhang

claim paper

Read More »

Voted

CORR
2002
Springer

132views Education» more CORR 2002»

Robust Feature Selection by Mutual Information Distributions

14 years 11 months ago

Download www.idsia.ch

Mutual information is widely used in artificial intelligence, in a descriptive way, to measure the stochastic dependence of discrete random variables. In order to address question...

Marco Zaffalon, Marcus Hutter

claim paper

Read More »

« Prev « First page 8 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers