Sciweavers

166 search results - page 30 / 34
» Safe exploration for reinforcement learning
Sort
View
ICRA
2005
IEEE
128views Robotics» more  ICRA 2005»
15 years 7 months ago
Vibration-based Terrain Analysis for Mobile Robots
—Safe, autonomous mobility in rough terrain is an important requirement for planetary exploration rovers. Knowledge of local terrain properties is critical to ensure a rover’s ...
Christopher A. Brooks, Karl Iagnemma, Steven Dubow...
COGSR
2011
71views more  COGSR 2011»
14 years 9 months ago
Psychological models of human and optimal performance in bandit problems
In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a fixed but unknown rate of reward, to maximize their total number of rewards ov...
Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...
124
Voted
COLT
2010
Springer
14 years 12 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
JMLR
2010
141views more  JMLR 2010»
14 years 8 months ago
Pinview: Implicit Feedback in Content-Based Image Retrieval
This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...
Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...
112
Voted
ISCAS
2002
IEEE
153views Hardware» more  ISCAS 2002»
15 years 6 months ago
Biological learning modeled in an adaptive floating-gate system
We have implemented an aspect of learning and memory in the nervous system using analog electronics. Using a simple synaptic circuit we realize networks with Hebbian type adaptati...
Christal Gordon, Paul E. Hasler