Search Sciweavers | Sciweavers

55 search results - page 4 / 11

» Approximate Policy Iteration using Large-Margin Classifiers

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Improving Approximate Value Iteration Using Memories and Predictive State Representations

13 years 6 months ago

Download www.aaai.org

Planning in partially-observable dynamical systems is a challenging problem, and recent developments in point-based techniques such as Perseus significantly improve performance as...

Michael R. James, Ton Wessling, Nikos A. Vlassis

claim paper

Read More »

click to vote

NIPS
2000

121views Information Technology» more NIPS 2000»

APRICODD: Approximate Policy Construction Using Decision Diagrams

13 years 6 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and p...

Robert St-Aubin, Jesse Hoey, Craig Boutilier

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

13 years 12 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

ICANN
2007
Springer

103views Neural Networks» more ICANN 2007»

Resilient Approximation of Kernel Classifiers

13 years 9 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

Abstract. Trained support vector machines (SVMs) have a slow runtime classification speed if the classification problem is noisy and the sample data set is large. Approximating the...

Thorsten Suttorp, Christian Igel

claim paper

Read More »

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

13 years 5 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

« Prev « First page 4 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers