Search Sciweavers | Sciweavers

515 search results - page 25 / 103

» Approximating Markov Processes by Averaging

105

click to vote

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

A theoretical analysis of Model-Based Interval Estimation

16 years 2 months ago

Download paul.rutgers.edu

Several algorithms for learning near-optimal policies in Markov Decision Processes have been analyzed and proven efficient. Empirical results have suggested that Model-based Inter...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

Voted

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Most likely heteroscedastic Gaussian process regression

16 years 2 months ago

Download www.machinelearning.org

This paper presents a novel Gaussian process (GP) approach to regression with inputdependent noise rates. We follow Goldberg et al.'s approach and model the noise variance us...

Kristian Kersting, Christian Plagemann, Patrick Pf...

claim paper

Read More »

140

click to vote

ANSS
1996
IEEE

134views Modeling and Simulation» more ANSS 1996»

Computation of the Asymptotic Bias and Variance for Simulation of Markov Reward Models

15 years 6 months ago

Download www.cs.ncl.ac.uk

The asymptotic bias and variance are important determinants of the quality of a simulation run. In particular, the asymptotic bias can be used to approximate the bias introduced b...

Aad P. A. van Moorsel, Latha A. Kant, William H. S...

claim paper

Read More »

103

Voted

SARA
2005
Springer

102views Artificial Intelligence» more SARA 2005»

Feature-Discovering Approximate Value Iteration Methods

15 years 7 months ago

Download cobweb.ecn.purdue.edu

Sets of features in Markov decision processes can play a critical role ximately representing value and in abstracting the state space. Selection of features is crucial to the succe...

Jia-Hong Wu, Robert Givan

claim paper

Read More »

101

Voted

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 2 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 25 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers