Sciweavers

417 search results - page 31 / 84
» Reinforcement Learning Estimation of Distribution Algorithm
Sort
View
IROS
2006
IEEE
144views Robotics» more  IROS 2006»
15 years 8 months ago
Estimating Probability Distribution with Q-learning for Biped Gait Generation and Optimization
— A new biped gait generation and optimization method is proposed in the frame of Estimation of Distribution Algorithms (EDAs) with Q-learning method. By formulating the biped ga...
Lingyun Hu, Changjiu Zhou, Zengqi Sun
128
Voted
ATAL
2004
Springer
15 years 7 months ago
Product Distribution Theory for Control of Multi-Agent Systems
Product Distribution (PD) theory is a new framework for controlling Multi-Agent Systems (MAS’s). First we review one motivation of PD theory, as the information-theoretic extens...
Chiu Fan Lee, David H. Wolpert
EMO
2005
Springer
107views Optimization» more  EMO 2005»
15 years 7 months ago
Multiobjective Water Pinch Analysis of the Cuernavaca City Water Distribution Network
Water systems often allow efficient water uses via water reuse and/or recirculation. Defining the network layout connecting water-using processes is a complex problem which involv...
Carlos E. Mariano-Romero, Víctor Alcocer-Ya...
IROS
2007
IEEE
168views Robotics» more  IROS 2007»
15 years 8 months ago
Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression
Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...
Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...
118
Voted
ICML
2007
IEEE
16 years 3 months ago
The rendezvous algorithm: multiclass semi-supervised learning with Markov random walks
We consider the problem of multiclass classification where both labeled and unlabeled data points are given. We introduce and demonstrate a new approach for estimating a distribut...
Arik Azran