Search Sciweavers | Sciweavers

226 search results - page 1 / 46

» A Monte-Carlo AIXI Approximation

104

Voted

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

14 years 9 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

136

Voted

AGI
2008

171views Artificial Intelligence» more AGI 2008»

A computational approximation to the AIXI model

15 years 4 months ago

Download www.agiri.org

Universal induction solves in principle the problem of choosing a prior to achieve optimal inductive inference. The AIXI theory, which combines control theory and universal induct...

Sergey Pankov

claim paper

Read More »

125

Voted

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

15 years 4 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

125

Voted

ICCS
2007
Springer

129views Applied Computing» more ICCS 2007»

Complexity of Monte Carlo Algorithms for a Class of Integral Equations

15 years 9 months ago

Download parallel.bas.bg

In this work we study the computational complexity of a class of grid Monte Carlo algorithms for integral equations. The idea of the algorithms consists in an approximation of the ...

Ivan Dimov, Rayna Georgieva

claim paper

Read More »

153

click to vote

ML
2007
ACM

192views Machine Learning» more ML 2007»

Annealing stochastic approximation Monte Carlo algorithm for neural network training

15 years 2 months ago

Download www.stat.tamu.edu

We propose a general-purpose stochastic optimization algorithm, the so-called annealing stochastic approximation Monte Carlo (ASAMC) algorithm, for neural network training. ASAMC c...

Faming Liang

claim paper

Read More »

« Prev « First page 1 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers