action spaces | Sciweavers

16

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Planning in Factored Action Spaces with Symbolic Dynamic Programming

11 years 6 months ago

We consider symbolic dynamic programming (SDP) for solving Markov Decision Processes (MDP) with factored state and action spaces, where both states and actions are described by se...

Aswin Raghavan, Saket Joshi, Alan Fern, Prasad Tad...

claim paper

Read More »

12

click to vote

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

13 years 2 months ago

Download scalab.uc3m.es

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

9

click to vote

ORL
2006

72views more ORL 2006»

A note on two-person zero-sum communicating stochastic games

13 years 4 months ago

Download www.rci.rutgers.edu

For undiscounted two-person zero-sum communicating stochastic games with finite state and action spaces, a solution procedure is proposed that exploits the communication property,...

Zeynep Müge Avsar, Melike Baykal-Gursoy

claim paper

Read More »

14

click to vote

AUTOMATICA
2007

82views more AUTOMATICA 2007»

Simulation-based optimal sensor scheduling with application to observer trajectory planning

13 years 4 months ago

Download www.cs.ubc.ca

The sensor scheduling problem can be formulated as a controlled hidden Markov model and this paper solves the problem when the state, observation and action spaces are continuous....

Sumeetpal S. Singh, Nikolaos Kantas, Ba-Ngu Vo, Ar...

claim paper

Read More »

14

click to vote

NIPS
2007

158views Information Technology» more NIPS 2007»

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

13 years 5 months ago

Download books.nips.cc

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...

Alessandro Lazaric, Marcello Restelli, Andrea Bona...

claim paper

Read More »

11

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Fitted Q-iteration by Advantage Weighted Regression

13 years 5 months ago

Download www.kyb.mpg.de

Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the re...

Gerhard Neumann, Jan Peters

claim paper

Read More »

10

click to vote

SIGECOM
2006
ACM

88views ECommerce» more SIGECOM 2006»

Implementation with a bounded action space

13 years 10 months ago

Download www.cs.huji.ac.il

While traditional mechanism design typically assumes isomorphism between the agents’ type- and action spaces, in many situations the agents face strict restrictions on their act...

Liad Blumrosen, Michal Feldman

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers