Search Sciweavers | Sciweavers

371 search results - page 70 / 75

» The Complexity of Decentralized Control of Markov Decision P...

102

click to vote

ATAL
2009
Springer

149views Intelligent Agents» more ATAL 2009»

Boolean combinations of weighted voting games

15 years 4 months ago

Download www.csc.liv.ac.uk

Weighted voting games are a natural and practically important class of simple coalitional games, in which each agent is assigned a numeric weight, and a coalition is deemed to be ...

Piotr Faliszewski, Edith Elkind, Michael Wooldridg...

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

15 years 10 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

106

Voted

IJRR
2011

218views more IJRR 2011»

Motion planning under uncertainty for robotic tasks with long time horizons

14 years 4 months ago

Download deslab.mit.edu

Abstract Partially observable Markov decision processes (POMDPs) are a principled mathematical framework for planning under uncertainty, a crucial capability for reliable operation...

Hanna Kurniawati, Yanzhu Du, David Hsu, Wee Sun Le...

claim paper

Read More »

click to vote

QEST
2007
IEEE

103views Modeling and Simulation» more QEST 2007»

GRIP: Generic Representatives in PRISM

15 years 3 months ago

Download qav.comlab.ox.ac.uk

We give an overview of GRIP, a symmetry reduction tool for the probabilistic model checker PRISM, together with experimental results for a selection of example speciﬁcations. 1 ...

Alastair F. Donaldson, Alice Miller, David Parker

claim paper

Read More »

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

15 years 3 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

« Prev « First page 70 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers