Search Sciweavers | Sciweavers

40 search results - page 3 / 8

» Learning Partially Observable Action Schemas

click to vote

COLT
2003
Springer

141views Machine Learning» more COLT 2003»

On-Line Learning with Imperfect Monitoring

13 years 10 months ago

Download www.ece.mcgill.ca

We study on-line play of repeated matrix games in which the observations of past actions of the other player and the obtained reward are partial and stochastic. We deﬁne the Part...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

AI
2007
Springer

181views Artificial Intelligence» more AI 2007»

Learning action models from plan examples using weighted MAX-SAT

13 years 5 months ago

Download www.cs.ust.hk

AI planning requires the deﬁnition of action models using a formal action and plan description language, such as the standard Planning Domain Deﬁnition Language (PDDL), as inp...

Qiang Yang, Kangheng Wu, Yunfei Jiang

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 6 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

click to vote

AIPS
2004

74views Artificial Intelligence» more AIPS 2004»

Statistical Goal Parameter Recognition

13 years 6 months ago

Download www.aaai.org

We present components of a system which uses statistical, corpus-based machine learning techniques to perform instantiated goal recognition -- recognition of both a goal schema an...

Nate Blaylock, James F. Allen

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

13 years 11 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers