Search Sciweavers | Sciweavers

656 search results - page 29 / 132

» Complexity of finite-horizon Markov decision process problem...

143

click to vote

ATMOS
2010

183views Optimization» more ATMOS 2010»

The Complexity of Integrating Routing Decisions in Public Transportation Models

15 years 25 days ago

Download drops.dagstuhl.de

To model and solve optimization problems arising in public transportation, data about the passengers is necessary and has to be included in the models in any phase of the planning...

Marie Schmidt, Anita Schöbel

claim paper

Read More »

116

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

16 years 2 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

110

click to vote

ICRA
2007
IEEE

134views Robotics» more ICRA 2007»

Grasping POMDPs

15 years 8 months ago

Download people.csail.mit.edu

Abstract— We provide a method for planning under uncertainty for robotic manipulation by partitioning the conﬁguration space into a set of regions that are closed under complia...

Kaijen Hsiao, Leslie Pack Kaelbling, Tomás ...

claim paper

Read More »

Voted

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 6 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

113

Voted

NIPS
2004

112views Information Technology» more NIPS 2004»

Learning first-order Markov models for control

15 years 3 months ago

Download books.nips.cc

First-order Markov models have been successfully applied to many problems, for example in modeling sequential data using Markov chains, and modeling control problems using the Mar...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 29 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers