Search Sciweavers | Sciweavers

682 search results - page 111 / 137

» One-Counter Markov Decision Processes

130

Voted

ICPR
2004
IEEE

217views computer vision» more ICPR 2004»

Complex Human Activity Recognition for Monitoring Wide Outdoor Environments

16 years 1 months ago

Download www.issia.cnr.it

The problem of automatic recognition of human activities is among the most important and challenging open areas of research in Computer Vision. This paper presents a new approach ...

Arcangelo Distante, I. Gnoni, Marco Leo, Paolo Spa...

claim paper

Read More »

102

Voted

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 1 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

111

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

16 years 1 months ago

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

116

Voted

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 1 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

131

Voted

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 1 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

« Prev « First page 111 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers