Sciweavers

87 search results - page 17 / 18
» Direct Policy Search Reinforcement Learning for Robot Contro...
Sort
View
TROB
2010
159views more  TROB 2010»
12 years 12 months ago
Task-Specific Generalization of Discrete and Periodic Dynamic Movement Primitives
Abstract--Acquisition of new sensorimotor knowledge by imitation is a promising paradigm for robot learning. To be effective, action learning should not be limited to direct replic...
Ales Ude, Andrej Gams, Tamim Asfour, Jun Morimoto
AAMAS
2005
Springer
13 years 5 months ago
Cooperative Multi-Agent Learning: The State of the Art
Cooperative multi-agent systems are ones in which several agents attempt, through their interaction, to jointly solve tasks or to maximize utility. Due to the interactions among t...
Liviu Panait, Sean Luke
ICML
2003
IEEE
14 years 6 months ago
Planning in the Presence of Cost Functions Controlled by an Adversary
We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...
H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum
CORR
2011
Springer
219views Education» more  CORR 2011»
13 years 7 days ago
Active Markov Information-Theoretic Path Planning for Robotic Environmental Sensing
Recent research in multi-robot exploration and mapping has focused on sampling environmental fields, which are typically modeled using the Gaussian process (GP). Existing informa...
Kian Hsiang Low, John M. Dolan, Pradeep K. Khosla
DAGSTUHL
2001
13 years 6 months ago
Decision-Theoretic Control of Planetary Rovers
Planetary rovers are small unmanned vehicles equipped with cameras and a variety of sensors used for scientific experiments. They must operate under tight constraints over such res...
Shlomo Zilberstein, Richard Washington, Daniel S. ...