Sciweavers

1233 search results - page 195 / 247
» Reinforcement Learning in MirrorBot
Sort
View
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
15 years 3 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
15 years 2 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...
IROS
2008
IEEE
165views Robotics» more  IROS 2008»
15 years 4 months ago
Mutual development of behavior acquisition and recognition based on value system
Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...
Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada
WAPCV
2007
Springer
15 years 3 months ago
Learning to Attend - From Bottom-Up to Top-Down
The control of overt visual attention relies on an interplay of bottom-up and top-down mechanisms. Purely bottom-up models may provide a reasonable account of the looking behaviors...
Hector Jasso, Jochen Triesch
CSE
2008
IEEE
15 years 4 months ago
Adaptation to Dynamic Resource Availability in Ad Hoc Grids through a Learning Mechanism
Ad-hoc Grids are highly heterogeneous and dynamic networks, one of the main challenges of resource allocation in such environments is to find mechanisms which do not rely on the ...
Behnaz Pourebrahimi, Koen Bertels