Search Sciweavers | Sciweavers

52 search results - page 8 / 11

» Error Bounds for Approximate Policy Iteration

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 29 days ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

115

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Statistical Compressed Sensing of Gaussian Mixture Models

14 years 6 months ago

Download www.cmap.polytechnique.fr

A novel framework of compressed sensing, namely statistical compressed sensing (SCS), that aims at efﬁciently sampling a collection of signals that follow a statistical distribu...

Guoshen Yu, Guillermo Sapiro

claim paper

Read More »

109

click to vote

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

15 years 4 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

106

click to vote

AAAI
2008

123views Intelligent Agents» more AAAI 2008»

Towards Faster Planning with Continuous Resources in Stochastic Domains

15 years 1 months ago

Download www.aaai.org

Agents often have to construct plans that obey resource limits for continuous resources whose consumption can only be characterized by probability distributions. While Markov Deci...

Janusz Marecki, Milind Tambe

claim paper

Read More »

104

Voted

CISS
2008
IEEE

146views Information Technology» more CISS 2008»

Appropriate control of wireless networks with flow level dynamics

15 years 6 months ago

Download www.princeton.edu

Abstract— We consider the network control problem for wireless networks with ﬂow level dynamics under the general k-hop interference model. In particular, we investigate the co...

Long Le, Ravi R. Mazumdar

claim paper

Read More »

« Prev « First page 8 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers