Sciweavers

656 search results - page 104 / 132
» Complexity of finite-horizon Markov decision process problem...
Sort
View
ICC
2008
IEEE
109views Communications» more  ICC 2008»
15 years 8 months ago
An MDP-Based Approach for Multipath Data Transmission over Wireless Networks
—Maintaining performance and reliability in wireless networks is a challenging task due to the nature of wireless channels. Multipath data transmission has been used in wired sce...
Vinh Bui, Weiping Zhu, Alessio Botta, Antonio Pesc...
ECML
2007
Springer
15 years 8 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
94
Voted
ICDM
2003
IEEE
96views Data Mining» more  ICDM 2003»
15 years 7 months ago
Mining Plans for Customer-Class Transformation
We consider the problem of mining high-utility plans from historical plan databases that can be used to transform customers from one class to other, more desirable classes. Tradit...
Qiang Yang, Hong Cheng
ICRA
2003
IEEE
167views Robotics» more  ICRA 2003»
15 years 7 months ago
Local exploration: online algorithms and a probabilistic framework
— Mapping an environment with an imaging sensor becomes very challenging if the environment to be mapped is unknown and has to be explored. Exploration involves the planning of v...
Volkan Isler, Sampath Kannan, Kostas Daniilidis
116
Voted
ATAL
2006
Springer
15 years 5 months ago
Solving POMDPs using quadratically constrained linear programs
Developing scalable algorithms for solving partially observable Markov decision processes (POMDPs) is an important challenge. One promising approach is based on representing POMDP...
Christopher Amato, Daniel S. Bernstein, Shlomo Zil...