Sciweavers

17582 search results - page 3126 / 3517
» From Distributed Sequential Computing to Distributed Paralle...
Sort
View
IJCAI
2003
15 years 6 months ago
Covariant Policy Search
We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...
J. Andrew Bagnell, Jeff G. Schneider
SODA
2004
ACM
98views Algorithms» more  SODA 2004»
15 years 6 months ago
End-to-end packet-scheduling in wireless ad-hoc networks
Abstract Packet-scheduling is a particular challenge in wireless networks due to interference from nearby transmissions. A distance-2 interference model serves as a useful abstract...
V. S. Anil Kumar, Madhav V. Marathe, Srinivasan Pa...
IJCAI
2003
15 years 6 months ago
Switching Hypothesized Measurements: A Dynamic Model with Applications to Occlusion Adaptive Joint Tracking
This paper proposes a dynamic model supporting multimodal state space probability distributions and presents the application of the model in dealing with visual occlusions when tr...
Yang Wang 0002, Tele Tan, Kia-Fock Loe
NIPS
2003
15 years 6 months ago
Reasoning about Time and Knowledge in Neural Symbolic Learning Systems
We show that temporal logic and combinations of temporal logics and modal logics of knowledge can be effectively represented in artificial neural networks. We present a Translat...
Artur S. d'Avila Garcez, Luís C. Lamb
AAAI
2000
15 years 6 months ago
Coordination for Multi-Robot Exploration and Mapping
This paper addresses the problem of exploration and mapping of an unknown environment by multiple robots. The mapping algorithm is an on-line approach to likelihood maximization t...
Reid G. Simmons, David Apfelbaum, Wolfram Burgard,...
« Prev « First page 3126 / 3517 Last » Next »