Search Sciweavers | Sciweavers

89 search results - page 14 / 18

» Sample-Based Planning for Continuous Action Markov Decision ...

Voted

CORR
2011
Springer

175views Education» more CORR 2011»

Adaptive Channel Recommendation for Dynamic Spectrum Access

14 years 6 months ago

Download home.ie.cuhk.edu.hk

—We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average rewa...

Xu Chen, Jianwei Huang, Husheng Li

claim paper

Read More »

Voted

ATAL
2009
Springer

134views Intelligent Agents» more ATAL 2009»

Improving adjustable autonomy strategies for time-critical domains

15 years 6 months ago

Download www.aamas-conference.org

As agents begin to perform complex tasks alongside humans as collaborative teammates, it becomes crucial that the resulting humanmultiagent teams adapt to time-critical domains. I...

Nathan Schurr, Janusz Marecki, Milind Tambe

claim paper

Read More »

103

Voted

ATAL
2009
Springer

103views Intelligent Agents» more ATAL 2009»

Lossless clustering of histories in decentralized POMDPs

15 years 6 months ago

Download www.science.uva.nl

Decentralized partially observable Markov decision processes (Dec-POMDPs) constitute a generic and expressive framework for multiagent planning under uncertainty. However, plannin...

Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J....

claim paper

Read More »

101

Voted

ATAL
2007
Springer

129views Intelligent Agents» more ATAL 2007»

Subjective approximate solutions for decentralized POMDPs

15 years 5 months ago

Download www.cs.cmu.edu

A problem of planning for cooperative teams under uncertainty is a crucial one in multiagent systems. Decentralized partially observable Markov decision processes (DECPOMDPs) prov...

Anton Chechetka, Katia P. Sycara

claim paper

Read More »

Voted

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 1 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

« Prev « First page 14 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers