Search Sciweavers | Sciweavers

238 search results - page 36 / 48

» Value-Function Approximations for Partially Observable Marko...

197

click to vote

GLOBECOM
2010
IEEE

244views Communications» more GLOBECOM 2010»

Maximize Secondary User Throughput via Optimal Sensing in Multi-Channel Cognitive Radio Networks

15 years 3 months ago

Download www3.ntu.edu.sg

In a cognitive radio network, the full-spectrum is usually divided into multiple channels. However, due to the hardware and energy constraints, a cognitive user (also called second...

Shimin Gong, Ping Wang, Wei Liu, Wei Yuan

claim paper

Read More »

147

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

16 years 6 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

149

click to vote

ATAL
2007
Springer

94views Intelligent Agents» more ATAL 2007»

Graphical models for online solutions to interactive POMDPs

15 years 11 months ago

Download www.cs.uga.edu

We develop a new graphical representation for interactive partially observable Markov decision processes (I-POMDPs) that is significantly more transparent and semantically clear t...

Prashant Doshi, Yifeng Zeng, Qiongyu Chen

claim paper

Read More »

151

click to vote

ECAI
2008
Springer

114views Artificial Intelligence» more ECAI 2008»

A hybrid approach to multi-agent decision-making

15 years 6 months ago

Download www.deetc.isel.ipl.pt

Abstract. In the aftermath of a large-scale disaster, agents’ decisions derive from self-interested (e.g. survival), common-good (e.g. victims’ rescue) and teamwork (e.g. ﬁre...

Paulo Trigo, Helder Coelho

claim paper

Read More »

171

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 6 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 36 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers