Sciweavers

40 search results - page 3 / 8
» Learning Partially Observable Action Schemas
Sort
View
121
Voted
COLT
2003
Springer
15 years 6 months ago
On-Line Learning with Imperfect Monitoring
We study on-line play of repeated matrix games in which the observations of past actions of the other player and the obtained reward are partial and stochastic. We define the Part...
Shie Mannor, Nahum Shimkin
128
Voted
AI
2007
Springer
15 years 1 months ago
Learning action models from plan examples using weighted MAX-SAT
AI planning requires the definition of action models using a formal action and plan description language, such as the standard Planning Domain Definition Language (PDDL), as inp...
Qiang Yang, Kangheng Wu, Yunfei Jiang
90
Voted
ICML
2008
IEEE
16 years 1 months ago
Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs
Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...
Finale Doshi, Joelle Pineau, Nicholas Roy
91
Voted
AIPS
2004
15 years 2 months ago
Statistical Goal Parameter Recognition
We present components of a system which uses statistical, corpus-based machine learning techniques to perform instantiated goal recognition -- recognition of both a goal schema an...
Nate Blaylock, James F. Allen
105
Voted
ECML
2007
Springer
15 years 7 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber