Search Sciweavers | Sciweavers

10 search results - page 2 / 2

» Learning Without State-Estimation in Partially Observable Ma...

click to vote

CDC
2008
IEEE

197views Control Systems» more CDC 2008»

Dynamic spectrum access policies for cognitive radio

13 years 11 months ago

Download www.ifp.illinois.edu

—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

click to vote

MOBICOM
2009
ACM

174views Communications» more MOBICOM 2009»

Interference management via rate splitting and HARQ over time-varying fading channels

13 years 11 months ago

Download web.njit.edu

The coexistence of two unlicensed links is considered, where one link interferes with the transmission of the other, over a timevarying, block-fading channel. In the absence of fa...

Marco Levorato, Osvaldo Simeone, Urbashi Mitra

claim paper

Read More »

click to vote

ICASSP
2008
IEEE

215views Signal Processing» more ICASSP 2008»

Bayesian update of dialogue state for robust dialogue systems

13 years 11 months ago

Download mi.eng.cam.ac.uk

This paper presents a new framework for accumulating beliefs in spoken dialogue systems. The technique is based on updating a Bayesian Network that represents the underlying state...

Blaise Thomson, Jost Schatzmann, Steve Young

claim paper

Read More »

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

13 years 2 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

13 years 11 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers