Sciweavers

4446 search results - page 68 / 890
» Learning Observer Agents
Sort
View
AAAI
1994
14 years 11 months ago
Acting Optimally in Partially Observable Stochastic Domains
In this paper, we describe the partially observable Markov decision process pomdp approach to nding optimal or near-optimal control strategies for partially observable stochastic ...
Anthony R. Cassandra, Leslie Pack Kaelbling, Micha...
AAAI
2004
14 years 11 months ago
Repeated Observation Models
Repetition is an important phenomenon in a variety of domains, such as music, computer programs and architectural drawings. A generative model for these domains should account for...
Avi Pfeffer
AAAI
2011
13 years 9 months ago
Risk-Averse Strategies for Security Games with Execution and Observational Uncertainty
Attacker-defender Stackelberg games have become a popular game-theoretic approach for security with deployments for LAX Police, the FAMS and the TSA. Unfortunately, most of the ex...
Zhengyu Yin, Manish Jain, Milind Tambe, Fernando O...
BIOADIT
2004
Springer
15 years 3 months ago
Autonomous Acquisition of the Meaning of Sensory States Through Sensory-Invariance Driven Action
Abstract. How can artificial or natural agents autonomously gain understanding of its own internal (sensory) state? This is an important question not just for physically embodied ...
Yoonsuck Choe, S. Kumar Bhamidipati
ATAL
2008
Springer
14 years 11 months ago
On the usefulness of opponent modeling: the Kuhn Poker case study
The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...
Alessandro Lazaric, Mario Quaresimale, Marcello Re...