Search Sciweavers | Sciweavers

248 search results - page 27 / 50

» Rate of Convergence for Constrained Stochastic Approximation...

184

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 7 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

190

click to vote

INFOCOM
2007
IEEE

172views Communications» more INFOCOM 2007»

Tradeoff Between Lifetime and Rate Allocation in Wireless Sensor Networks: A Cross Layer Approach

16 years 16 days ago

Download awin.cs.ccu.edu.tw

— This paper studies the tradeoff between energy consumption and application performance in wireless sensor networks by investigating the interaction between network lifetime max...

Junhua Zhu, Shan Chen, Brahim Bensaou, Ka-Lok Hung

claim paper

Read More »

165

click to vote

NIPS
2008

109views Information Technology» more NIPS 2008»

Biasing Approximate Dynamic Programming with a Lower Discount Factor

15 years 7 months ago

Download hal.inria.fr

Most algorithms for solving Markov decision processes rely on a discount factor, which ensures their convergence. It is generally assumed that using an artificially low discount f...

Marek Petrik, Bruno Scherrer

claim paper

Read More »

160

click to vote

IAT
2006
IEEE

118views Intelligent Agents» more IAT 2006»

Using Prior Knowledge to Improve Distributed Hill Climbing

16 years 9 days ago

Download www.personal.utulsa.edu

The Distributed Probabilistic Protocol (DPP) is a new, approximate algorithm for solving Distributed Constraint Satisfaction Problems (DCSPs) that exploits prior knowledge to impr...

Roger Mailler

claim paper

Read More »

201

click to vote

MOC
2010

131views Security Privacy» more MOC 2010»

H(div) preconditioning for a mixed finite element formulation of the diffusion problem with random data

15 years 1 months ago

Download www.cs.umd.edu

We study H(div) preconditioning for the saddle-point systems that arise in a stochastic Galerkin mixed formulation of the steady-state diffusion problem with random data. The key i...

Howard C. Elman, Darran G. Furnival, Catherine E. ...

claim paper

Read More »

« Prev « First page 27 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers