Search Sciweavers | Sciweavers

10 search results - page 1 / 2

» Asymptotic Learnability of Reinforcement Problems with Arbit...

click to vote

ALT
2006
Springer

111views Machine Learning» more ALT 2006»

Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence

14 years 1 months ago

Download www.idsia.ch

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...

Daniil Ryabko, Marcus Hutter

claim paper

Read More »

click to vote

MMAS
2010
Springer

162views Intelligent Agents» more MMAS 2010»

An Asymptotic Analysis of the Mean First Passage Time for Narrow Escape Problems: Part I: Two-Dimensional Domains

12 years 11 months ago

Download www.math.ubc.ca

The mean first passage time (MFPT) is calculated for a Brownian particle in a bounded two-dimensional domain that contains N small nonoverlapping absorbing windows on its boundary....

S. Pillay, Michael J. Ward, A. Peirce, Theodore Ko...

claim paper

Read More »

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

13 years 10 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

SIGCOMM
1996
ACM

160views Communications» more SIGCOMM 1996»

On the Relevance of Long-Range Dependence in Network Traffic

13 years 8 months ago

Download www.cs.unc.edu

There is much experimental evidence that network traffic processes exhibit ubiquitous properties of self-similarity and long-range dependence, i.e., of correlations over a wide ran...

Matthias Grossglauser, Jean-Chrysostome Bolot

claim paper

Read More »

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

12 years 11 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers