Search Sciweavers | Sciweavers

75 search results - page 4 / 15

» A Predictive Model for Imitation Learning in Partially Obser...

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

13 years 11 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

click to vote

IJCNN
2006
IEEE

85views Neural Networks» more IJCNN 2006»

Learning to Segment Any Random Vector

14 years 18 hour ago

Download www.cs.helsinki.fi

— We propose a method that takes observations of a random vector as input, and learns to segment each observation into two disjoint parts. We show how to use the internal coheren...

Aapo Hyvärinen, Jukka Perkiö

claim paper

Read More »

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

14 years 6 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

click to vote

ICML
2009
IEEE

143views Machine Learning» more ICML 2009»

Proto-predictive representation of states with simple recurrent temporal-difference networks

14 years 6 months ago

Download www.snowelm.com

We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...

Takaki Makino

claim paper

Read More »

click to vote

ICML
2004
IEEE

123views Machine Learning» more ICML 2004»

Learning low dimensional predictive representations

14 years 6 months ago

Download www.cs.cmu.edu

Predictive state representations (PSRs) have recently been proposed as an alternative to partially observable Markov decision processes (POMDPs) for representing the state of a dy...

Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian...

claim paper

Read More »

« Prev « First page 4 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers