State Space | Sciweavers

35

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

13 years 10 months ago

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...

Carl Edward Rasmussen, Malte Kuss

claim paper

Read More »

26

click to vote

NIPS
2003

158views Information Technology» more NIPS 2003»

Envelope-based Planning in Relational MDPs

13 years 10 months ago

Download books.nips.cc

A mobile robot acting in the world is faced with a large amount of sensory data and uncertainty in its action outcomes. Indeed, almost all interesting sequential decision-making d...

Natalia Hernandez-Gardiol, Leslie Pack Kaelbling

claim paper

Read More »

29

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

13 years 10 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

30

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

13 years 10 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

23

click to vote

CASCON
2006

98views Education» more CASCON 2006»

A lightweight approach to state based security testing

13 years 10 months ago

Download post.queensu.ca

State based protocols are protocols in which the handling of one message depends on the contents of previous messages. Testing such protocols, for security or for other purposes u...

Songtao Zhang, Thomas R. Dean, Scott Knight

claim paper

Read More »

33

click to vote

AIIDE
2006

123views Artificial Intelligence» more AIIDE 2006»

The Self Organization of Context for Learning in MultiAgent Games

13 years 10 months ago

Download www.aaai.org

Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, ti...

Christopher D. White, Dave Brogan

claim paper

Read More »

33

click to vote

AWPN
2008

273views Algorithms» more AWPN 2008»

An Approach to Tackle Livelock-Freedom in SOA

13 years 10 months ago

Download wwwteo.informatik.uni-rostock.de

We calculate a fixed finite set of state space fragments for a service P, where each fragment carries a part of the whole behavior of P. By composing these fragments according to t...

Christian Stahl, Karsten Wolf

claim paper

Read More »

27

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

13 years 11 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

30

click to vote

ASMTA
2008
Springer

167views Mathematics» more ASMTA 2008»

Perfect Simulation of Stochastic Automata Networks

13 years 11 months ago

Download lacl.univ-paris12.fr

The solution of continuous and discrete-time Markovian models is still challenging mainly when we model large complex systems, for example, to obtain performance indexes of paralle...

Paulo Fernandes, Jean-Marc Vincent, Thais Webber

claim paper

Read More »

30

click to vote

APN
2008
Springer

127views Artificial Intelligence» more APN 2008»

Symbolic State Space of Stopwatch Petri Nets with Discrete-Time Semantics (Theory Paper)

13 years 11 months ago

Download pagesperso-systeme.lip6.fr

In this paper, we address the class of bounded Petri nets with stopwatches (SwPNs), which is an extension of T-time Petri nets (TPNs) where time is associated with transitions. Con...

Morgan Magnin, Didier Lime, Olivier H. Roux

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers