Search Sciweavers | Sciweavers

150

ICRA
2010
IEEE

163views Robotics» more ICRA 2010»

Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs

15 years 4 months ago

Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...

Salvatore Candido, James C. Davidson, Seth Hutchin...

claim paper

Read More »

153

click to vote

ICRA
2010
IEEE

151views Robotics» more ICRA 2010»

Estimation of model parameters for steerable needles

15 years 4 months ago

Download reedlab.eng.usf.edu

Abstract— Flexible needles with bevel tips are being developed as useful tools for minimally invasive surgery and percutaneous therapy. When such a needle is inserted into soft t...

Wooram Park, Kyle Brandon Reed, Allison M. Okamura...

claim paper

Read More »

167

click to vote

ICRA
2010
IEEE

145views Robotics» more ICRA 2010»

Reinforcement learning of motor skills in high dimensions: A path integral approach

15 years 4 months ago

Download www-personal.acfr.usyd.edu.au

— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

154

click to vote

ICRA
2010
IEEE

146views Robotics» more ICRA 2010»

Statistical mobility prediction for planetary surface exploration rovers in uncertain terrain

15 years 4 months ago

Download web.mit.edu

— Planetary surface exploration rovers must accurately and efficiently predict their mobility on natural, rough terrain. Most approaches to mobility prediction assume precise a p...

Genya Ishigami, Gaurav Kewlani, Karl Iagnemma

claim paper

Read More »

168

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 3 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers