Sciweavers

397 search results - page 60 / 80
» Reinforcement Learning with Hierarchies of Machines
Sort
View
COLT
2010
Springer
14 years 7 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
ICML
2005
IEEE
15 years 10 months ago
Combining model-based and instance-based learning for first order regression
T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...
Kurt Driessens, Saso Dzeroski
ECCV
2008
Springer
15 years 11 months ago
Training Hierarchical Feed-Forward Visual Recognition Models Using Transfer Learning from Pseudo-Tasks
Abstract. Building visual recognition models that adapt across different domains is a challenging task for computer vision. While feature-learning machines in the form of hierarchi...
Amr Ahmed, Kai Yu, Wei Xu, Yihong Gong, Eric P. Xi...
IROS
2006
IEEE
147views Robotics» more  IROS 2006»
15 years 3 months ago
A Hybrid Control Architecture for Autonomous Robotic Fish
— This paper presents a hybrid control architecture for autonomous robotic fishes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...
Jindong Liu, Huosheng Hu, Dongbing Gu
ICRA
1994
IEEE
105views Robotics» more  ICRA 1994»
15 years 1 months ago
Harmonic Functions and Collision Probabilities
There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...
Christopher I. Connolly