Search Sciweavers | Sciweavers

65 search results - page 6 / 13

» An Embeddable Virtual Machine for State Space Generation

click to vote

ICML
2004
IEEE

161views Machine Learning» more ICML 2004»

Using relative novelty to identify useful temporal abstractions in reinforcement learning

16 years 12 days ago

Download www.cs.umass.edu

lative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning ?Ozg?ur S?im?sek ozgur@cs.umass.edu Andrew G. Barto barto@cs.umass.edu Department of Computer Scie...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

click to vote

DAGSTUHL
2007

178views Software Engineering» more DAGSTUHL 2007»

Programming self developing blob machines for spatial computing.

15 years 1 months ago

Download drops.dagstuhl.de

: This is a position paper introducing blob computing: A Blob is a generic primitive used to structure a uniform computing substrate into an easier-to-program parallel virtual mach...

Frédéric Gruau, Christine Eisenbeis

claim paper

Read More »

Voted

ICML
2005
IEEE

145views Machine Learning» more ICML 2005»

Proto-value functions: developmental reinforcement learning

16 years 12 days ago

Download www.cs.umass.edu

This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...

Sridhar Mahadevan

claim paper

Read More »

click to vote

ICML
2005
IEEE

135views Machine Learning» more ICML 2005»

Finite time bounds for sampling based fitted value iteration

16 years 12 days ago

Download www.machinelearning.org

In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...

Csaba Szepesvári, Rémi Munos

claim paper

Read More »

click to vote

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

14 years 11 months ago

Download www.cis.upenn.edu

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 6 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers