Sciweavers

1310 search results - page 31 / 262
» Progressive Optimization in Action
Sort
View
IJCAI
1993
15 years 3 months ago
Anytime Sensing Planning and Action: A Practical Model for Robot Control
Anytime algorithms, whose quality of results improves gradually as computation time increases, provide useful performance components for timecritical planning and control of robot...
Shlomo Zilberstein, Stuart J. Russell
CORR
2012
Springer
210views Education» more  CORR 2012»
13 years 9 months ago
Towards minimax policies for online linear optimization with bandit feedback
We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of...
Sébastien Bubeck, Nicolò Cesa-Bianch...
AAMAS
2005
Springer
15 years 7 months ago
Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems
We report on an investigation of the learning of coordination in cooperative multi-agent systems. Specifically, we study solutions that are applicable to independent agents i.e. ...
Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...
BDA
2007
15 years 3 months ago
Towards Action-Oriented Continuous Queries in Pervasive Systems
Pervasive information systems give an overview of what digital environments should look like in the future. From a data-centric point of view, traditional databases have to be use...
Yann Gripay, Frédérique Laforest, Je...
ICANN
2005
Springer
15 years 7 months ago
Reinforcement Learning in MirrorBot
For this special session of EU projects in the area of NeuroIT, we will review the progress of the MirrorBot project with special emphasis on its relation to reinforcement learning...
Cornelius Weber, David Muse, Mark Elshaw, Stefan W...