Search Sciweavers | Sciweavers

187 search results - page 34 / 38

» Hedging Uncertainty: Approximation Algorithms for Stochastic...

163

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

14 years 8 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

122

click to vote

ICTAI
2009
IEEE

86views Artificial Intelligence» more ICTAI 2009»

TiMDPpoly: An Improved Method for Solving Time-Dependent MDPs

14 years 11 months ago

Download www.montefiore.ulg.ac.be

We introduce TiMDPpoly, an algorithm designed to solve planning problems with durative actions, under probabilistic uncertainty, in a non-stationary, continuous-time context. Miss...

Emmanuel Rachelson, Patrick Fabiani, Fréd&e...

claim paper

Read More »

148

click to vote

DATAMINE
2010

175views more DATAMINE 2010»

Extracting influential nodes on a social network for information diffusion

15 years 1 months ago

Download www.ar.sanken.osaka-u.ac.jp

We address the combinatorial optimization problem of finding the most influential nodes on a large-scale social network for two widely-used fundamental stochastic diffusion models...

Masahiro Kimura, Kazumi Saito, Ryohei Nakano, Hiro...

claim paper

Read More »

116

click to vote

ATAL
2003
Springer

185views Intelligent Agents» more ATAL 2003»

Optimizing information exchange in cooperative multi-agent systems

15 years 6 months ago

Download rbr.cs.umass.edu

Decentralized control of a cooperative multi-agent system is the problem faced by multiple decision-makers that share a common set of objectives. The decision-makers may be robots...

Claudia V. Goldman, Shlomo Zilberstein

claim paper

Read More »

153

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

14 years 11 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

« Prev « First page 34 / 38 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers