Sciweavers

2936 search results - page 491 / 588
» Cooperative Learning in Simulation
Sort
View
112
Voted
ML
2002
ACM
143views Machine Learning» more  ML 2002»
15 years 1 months ago
A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...
Michael J. Kearns, Yishay Mansour, Andrew Y. Ng
NN
2002
Springer
123views Neural Networks» more  NN 2002»
15 years 1 months ago
Neuromodulation and plasticity in an autonomous robot
In this paper we implement a computational model of a neuromodulatory system in an autonomous robot. The output of the neuromodulatory system acts as a value signal, modulating wi...
Olaf Sporns, William H. Alexander
CORR
2010
Springer
98views Education» more  CORR 2010»
15 years 1 months ago
Structure-Aware Stochastic Control for Transmission Scheduling
In this report, we consider the problem of real-time transmission scheduling over time-varying channels. We first formulate the transmission scheduling problem as a Markov decisio...
Fangwen Fu, Mihaela van der Schaar
ICRA
2010
IEEE
162views Robotics» more  ICRA 2010»
15 years 1 days ago
Adaptive multi-robot coordination: A game-theoretic perspective
Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...
Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus
PKDD
2010
Springer
122views Data Mining» more  PKDD 2010»
14 years 12 months ago
Exploration in Relational Worlds
Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...
Tobias Lang, Marc Toussaint, Kristian Kersting