Abstract. Q-learning can be used to learn a control policy that maximises a scalar reward through interaction with the environment. Qlearning is commonly applied to problems with d...
Chris Gaskett, David Wettergreen, Alexander Zelins...
This paper describes Embra, a simulator for the processors, caches, and memory systems of uniprocessors and cache-coherent multiprocessors. When running as part of the SimOS simul...
Abstract— This paper presents a study which extends previous work on collective motion control of a group of vehicles for target tracking. The approach is to drive the group cent...
Anawat Pongpunwattana, Benjamin I. Triplett, Krist...
— A biped walking pattern generator which allows an additional ZMP control (auxiliary ZMP) is presented. An auxiliary ZMP is realized by an inverse system added to a pattern gene...
— We report on the design and stability analysis of a simple quadruped running controller that can autonomously generate steady running with good energy efficiency and suppress ...