Search Sciweavers | Sciweavers

397 search results - page 20 / 80

» Reinforcement Learning with Hierarchies of Machines

click to vote

ICML
2007
IEEE

146views Machine Learning» more ICML 2007»

Mixtures of hierarchical topics with Pachinko allocation

16 years 19 days ago

Download www.machinelearning.org

The four-level pachinko allocation model (PAM) (Li & McCallum, 2006) represents correlations among topics using a DAG structure. It does not, however, represent a nested hiera...

David M. Mimno, Wei Li, Andrew McCallum

claim paper

Read More »

100

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 19 days ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

15 years 1 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

115

click to vote

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

15 years 5 months ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

110

click to vote

COR
2008

142views more COR 2008»

Application of reinforcement learning to the game of Othello

14 years 12 months ago

Download www.cs.uu.nl

Operations research and management science are often confronted with sequential decision making problems with large state spaces. Standard methods that are used for solving such c...

Nees Jan van Eck, Michiel C. van Wezel

claim paper

Read More »

« Prev « First page 20 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers