Search Sciweavers | Sciweavers

46 search results - page 2 / 10

» Discretization of Continuous Action Spaces in Extensive-Form...

click to vote

CDC
2009
IEEE

173views Control Systems» more CDC 2009»

Sequentially updated Probability Collectives

13 years 9 months ago

Download www.maths.bris.ac.uk

— Multi-agent coordination problems can be cast as distributed optimization tasks. Probability Collectives (PCs) are techniques that deal with such problems in discrete and conti...

Michalis Smyrnakis, David S. Leslie

claim paper

Read More »

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

14 years 5 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

click to vote

AAAI
1998

150views Intelligent Agents» more AAAI 1998»

Tree Based Discretization for Continuous State Space Reinforcement Learning

13 years 6 months ago

Download www.cs.cmu.edu

Reinforcement learning is an effective technique for learning action policies in discrete stochastic environments, but its efficiency can decay exponentially with the size of the ...

William T. B. Uther, Manuela M. Veloso

claim paper

Read More »

click to vote

ICFEM
2009
Springer

115views Software Engineering» more ICFEM 2009»

Qualitative Action Systems

13 years 11 months ago

Download www.ist.tugraz.at

An extension to action systems is presented facilitating the modeling of continuous behavior in the discrete domain. The original action system formalism has been developed by Back...

Bernhard K. Aichernig, Harald Brandl, Willibald Kr...

claim paper

Read More »

click to vote

AIPS
2011

233views Artificial Intelligence» more AIPS 2011»

Sample-Based Planning for Continuous Action Markov Decision Processes

12 years 8 months ago

Download www.chrismansley.com

In this paper, we present a new algorithm that integrates recent advances in solving continuous bandit problems with sample-based rollout methods for planning in Markov Decision P...

Christopher R. Mansley, Ari Weinstein, Michael L. ...

claim paper

Read More »

« Prev « First page 2 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers