Sciweavers

1799 search results - page 163 / 360
» Filtered Reinforcement Learning
Sort
View
135
Voted
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
15 years 10 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
15 years 9 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...
145
Voted
NIPS
2008
15 years 6 months ago
Hebbian Learning of Bayes Optimal Decisions
Uncertainty is omnipresent when we perceive or interact with our environment, and the Bayesian framework provides computational methods for dealing with it. Mathematical models fo...
Bernhard Nessler, Michael Pfeiffer, Wolfgang Maass
117
Voted
DIS
2006
Springer
15 years 8 months ago
Kalman Filters and Adaptive Windows for Learning in Data Streams
We study the combination of Kalman filter and a recently proposed algorithm for dynamically maintaining a sliding window, for learning from streams of examples. We integrate this i...
Albert Bifet, Ricard Gavaldà
IROS
2008
IEEE
165views Robotics» more  IROS 2008»
15 years 11 months ago
Mutual development of behavior acquisition and recognition based on value system
Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...
Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada