Sciweavers

40 search results - page 6 / 8
» Learning Partially Observable Action Schemas
Sort
View
ALT
2005
Springer
14 years 2 months ago
Defensive Universal Learning with Experts
This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedba...
Jan Poland, Marcus Hutter
ML
2006
ACM
113views Machine Learning» more  ML 2006»
13 years 5 months ago
Learning to bid in bridge
Bridge bidding is considered to be one of the most difficult problems for game-playing programs. It involves four agents rather than two, including a cooperative agent. In additio...
Asaf Amit, Shaul Markovitch
TSMC
2008
132views more  TSMC 2008»
13 years 5 months ago
Ensemble Algorithms in Reinforcement Learning
This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and fin...
Marco A. Wiering, Hado van Hasselt
AIIDE
2009
13 years 3 months ago
IMPLANT: An Integrated MDP and POMDP Learning AgeNT for Adaptive Games
This paper proposes an Integrated MDP and POMDP Learning AgeNT (IMPLANT) architecture for adaptation in modern games. The modern game world basically involves a human player actin...
Chek Tien Tan, Ho-Lun Cheng
ATAL
2010
Springer
13 years 6 months ago
Closing the learning-planning loop with predictive state representations
A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...
Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon