Search Sciweavers | Sciweavers

162 search results - page 9 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

167

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 7 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

180

click to vote

JAIR
2006

160views more JAIR 2006»

Anytime Point-Based Approximations for Large POMDPs

15 years 6 months ago

Download www.jair.org

The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact s...

Joelle Pineau, Geoffrey J. Gordon, Sebastian Thrun

claim paper

Read More »

149

Voted

ATAL
2010
Springer

141views Intelligent Agents» more ATAL 2010»

Risk-sensitive planning in partially observable environments

15 years 7 months ago

Download www.aamas-conference.org

Partially Observable Markov Decision Process (POMDP) is a popular framework for planning under uncertainty in partially observable domains. Yet, the POMDP model is riskneutral in ...

Janusz Marecki, Pradeep Varakantham

claim paper

Read More »

165

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 18 days ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

185

click to vote

TCOM
2011

114views more TCOM 2011»

Iterative Receivers Based on Subblock Processing for Phase Noise Compensation in OFDM Systems

15 years 29 days ago

Download csc.postech.ac.kr

—An iterative algorithm employing decision feedback provided by either an equalizer or a channel decoder is proposed in order to compensate for the phase noise resulting from imp...

Myung-Kyu Lee, Kyeongcheol Yang, Kyungwhoon Cheun

claim paper

Read More »

« Prev « First page 9 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers