Search Sciweavers | Sciweavers

178 search results - page 24 / 36

» Efficient Approximation of Optimal Control for Markov Games

146

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

204

click to vote

TON
2010

168views more TON 2010»

Engineering Wireless Mesh Networks: Joint Scheduling, Routing, Power Control, and Rate Adaptation

15 years 26 days ago

Download www3.ntu.edu.sg

Abstract--We present a number of significant engineering insights on what makes a good configuration for medium- to largesize wireless mesh networks (WMNs) when the objective funct...

Jun Luo, Catherine Rosenberg, André Girard

claim paper

Read More »

184

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 7 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

172

click to vote

ESANN
2001

116views Neural Networks» more ESANN 2001»

Learning fault-tolerance in Radial Basis Function Networks

15 years 7 months ago

Download www.dice.ucl.ac.be

This paper describes a method of supervised learning based on forward selection branching. This method improves fault tolerance by means of combining information related to general...

Xavier Parra, Andreu Català

claim paper

Read More »

155

click to vote

AIPS
2008

111views Artificial Intelligence» more AIPS 2008»

Multiagent Planning Under Uncertainty with Stochastic Communication Delays

15 years 8 months ago

Download www.aaai.org

We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...

Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...

claim paper

Read More »

« Prev « First page 24 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers