Sciweavers

575 search results - page 85 / 115
» Reinforcement Learning State Estimator
Sort
View
DEXA
2004
Springer
159views Database» more  DEXA 2004»
15 years 1 months ago
Adaptive Double Routing Indices: Combining Effectiveness and Efficiency in P2P Systems
Unstructured peer-to-peer systems rely on strategies and data structures (Routing Indices) for the routing of requests in the network. For those requests corresponding to informati...
Stéphane Bressan, Achmad Nizar Hidayanto, C...
CDC
2008
IEEE
142views Control Systems» more  CDC 2008»
15 years 4 months ago
Convergence of rule-of-thumb learning rules in social networks
— We study the problem of dynamic learning by a social network of agents. Each agent receives a signal about an underlying state and communicates with a subset of agents (his nei...
Daron Acemoglu, Angelia Nedic, Asuman E. Ozdaglar
ATAL
2004
Springer
15 years 3 months ago
Unifying Temporal and Structural Credit Assignment Problems
Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...
Adrian K. Agogino, Kagan Tumer
ICRA
2010
IEEE
162views Robotics» more  ICRA 2010»
14 years 8 months ago
Adaptive multi-robot coordination: A game-theoretic perspective
Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...
Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus
RAS
2010
164views more  RAS 2010»
14 years 8 months ago
Bridging the gap between feature- and grid-based SLAM
One important design decision for the development of autonomously navigating mobile robots is the choice of the representation of the environment. This includes the question which...
Kai M. Wurm, Cyrill Stachniss, Giorgio Grisetti