Sciweavers

1434 search results - page 253 / 287
» Stochastic computation
Sort
View
ICRA
2010
IEEE
163views Robotics» more  ICRA 2010»
14 years 8 months ago
Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs
Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...
Salvatore Candido, James C. Davidson, Seth Hutchin...
ICRA
2010
IEEE
151views Robotics» more  ICRA 2010»
14 years 8 months ago
Estimation of model parameters for steerable needles
Abstract— Flexible needles with bevel tips are being developed as useful tools for minimally invasive surgery and percutaneous therapy. When such a needle is inserted into soft t...
Wooram Park, Kyle Brandon Reed, Allison M. Okamura...
ICRA
2010
IEEE
145views Robotics» more  ICRA 2010»
14 years 8 months ago
Reinforcement learning of motor skills in high dimensions: A path integral approach
— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal
ICRA
2010
IEEE
146views Robotics» more  ICRA 2010»
14 years 8 months ago
Statistical mobility prediction for planetary surface exploration rovers in uncertain terrain
— Planetary surface exploration rovers must accurately and efficiently predict their mobility on natural, rough terrain. Most approaches to mobility prediction assume precise a p...
Genya Ishigami, Gaurav Kewlani, Karl Iagnemma
COLT
2010
Springer
14 years 7 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura