Sciweavers

75 search results - page 4 / 15
» A Predictive Model for Imitation Learning in Partially Obser...
Sort
View
ECML
2005
Springer
13 years 11 months ago
Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes
Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...
Masoumeh T. Izadi, Doina Precup
IJCNN
2006
IEEE
14 years 18 hour ago
Learning to Segment Any Random Vector
— We propose a method that takes observations of a random vector as input, and learns to segment each observation into two disjoint parts. We show how to use the internal coheren...
Aapo Hyvärinen, Jukka Perkiö
ICML
2004
IEEE
14 years 6 months ago
Utile distinction hidden Markov models
This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...
Daan Wierstra, Marco Wiering
ICML
2009
IEEE
14 years 6 months ago
Proto-predictive representation of states with simple recurrent temporal-difference networks
We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...
Takaki Makino
ICML
2004
IEEE
14 years 6 months ago
Learning low dimensional predictive representations
Predictive state representations (PSRs) have recently been proposed as an alternative to partially observable Markov decision processes (POMDPs) for representing the state of a dy...
Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian...