Sciweavers

682 search results - page 121 / 137
» One-Counter Markov Decision Processes
Sort
View
89
Voted
ACMACE
2008
ACM
15 years 2 months ago
AIRSF: a new entertainment adaptive framework for stress free air travels
In this paper, we present a new entertainment adaptive framework AIRSF for stress free air travels. Based on the passenger's current and target comfort states, user entertain...
Hao Liu, Jun Hu, Matthias Rauterberg
ATAL
2008
Springer
15 years 2 months ago
MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions
Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need...
Doran Chakraborty, Sandip Sen
89
Voted
ATAL
2008
Springer
15 years 2 months ago
Expediting RL by using graphical structures
The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...
Peng Dai, Alexander L. Strehl, Judy Goldsmith
82
Voted
AAAI
2010
15 years 2 months ago
Structured Parameter Elicitation
The behavior of a complex system often depends on parameters whose values are unknown in advance. To operate effectively, an autonomous agent must actively gather information on t...
Li Ling Ko, David Hsu, Wee Sun Lee, Sylvie C. W. O...
AAAI
2010
15 years 2 months ago
Symbolic Dynamic Programming for First-order POMDPs
Partially-observable Markov decision processes (POMDPs) provide a powerful model for sequential decision-making problems with partially-observed state and are known to have (appro...
Scott Sanner, Kristian Kersting