Search Sciweavers | Sciweavers

1310 search results - page 31 / 262

» Progressive Optimization in Action

147

click to vote

IJCAI
1993

111views Artificial Intelligence» more IJCAI 1993»

Anytime Sensing Planning and Action: A Practical Model for Robot Control

15 years 5 months ago

Download anytime.cs.umass.edu

Anytime algorithms, whose quality of results improves gradually as computation time increases, provide useful performance components for timecritical planning and control of robot...

Shlomo Zilberstein, Stuart J. Russell

claim paper

Read More »

148

click to vote

CORR
2012
Springer

210views Education» more CORR 2012»

Towards minimax policies for online linear optimization with bandit feedback

14 years 6 hour ago

Download www.princeton.edu

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of...

Sébastien Bubeck, Nicolò Cesa-Bianch...

claim paper

Read More »

122

click to vote

AAMAS
2005
Springer

126views Intelligent Agents» more AAMAS 2005»

Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems

15 years 9 months ago

Download como.vub.ac.be

We report on an investigation of the learning of coordination in cooperative multi-agent systems. Speciﬁcally, we study solutions that are applicable to independent agents i.e. ...

Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...

claim paper

Read More »

149

click to vote

BDA
2007

126views Knowledge Management» more BDA 2007»

Towards Action-Oriented Continuous Queries in Pervasive Systems

15 years 5 months ago

Download liris.cnrs.fr

Pervasive information systems give an overview of what digital environments should look like in the future. From a data-centric point of view, traditional databases have to be use...

Yann Gripay, Frédérique Laforest, Je...

claim paper

Read More »

151

click to vote

ICANN
2005
Springer

151views Neural Networks» more ICANN 2005»

Reinforcement Learning in MirrorBot

15 years 9 months ago

Download fias.uni-frankfurt.de

For this special session of EU projects in the area of NeuroIT, we will review the progress of the MirrorBot project with special emphasis on its relation to reinforcement learning...

Cornelius Weber, David Muse, Mark Elshaw, Stefan W...

claim paper

Read More »

« Prev « First page 31 / 262 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers