Sciweavers

44 search results - page 9 / 9
» Batch reinforcement learning in a complex domain
Sort
View
GECCO
2005
Springer
162views Optimization» more  GECCO 2005»
13 years 10 months ago
An autonomous explore/exploit strategy
In reinforcement learning problems it has been considered that neither exploitation nor exploration can be pursued exclusively without failing at the task. The optimal balance bet...
Alex McMahon, Dan Scott, William N. L. Browne
IJCAI
2003
13 years 6 months ago
Use of Off-line Dynamic Programming for Efficient Image Interpretation
An interpretation system finds the likely mappings from portions of an image to real-world objects. An interpretation policy specifies when to apply which imaging operator, to whi...
Ramana Isukapalli, Russell Greiner
AAAI
2008
13 years 7 months ago
Adaptive Management of Air Traffic Flow: A Multiagent Coordination Approach
This paper summarizes recent advances in the application of multiagent coordination algorithms to air traffic flow management. Indeed, air traffic flow management is one of the fu...
Kagan Tumer, Adrian K. Agogino
DAGSTUHL
2001
13 years 6 months ago
Decision-Theoretic Control of Planetary Rovers
Planetary rovers are small unmanned vehicles equipped with cameras and a variety of sensors used for scientific experiments. They must operate under tight constraints over such res...
Shlomo Zilberstein, Richard Washington, Daniel S. ...