Sciweavers

125 search results - page 24 / 25
» Reinforcement Learning in Continuous Time and Space
Sort
View
EDUTAINMENT
2006
Springer
13 years 9 months ago
Trans-disciplinary Avenues in Education: Computing and Art
In this paper we report on an interdisciplinary course "Computing and Art" taught at the Sabanci University, Istanbul for the first time in fall of 2004. We also present...
Selim Balcisoy, Elif E. Ayiter
CEC
2005
IEEE
13 years 11 months ago
Sensorimotor experience and its metrics: informational geometry and the temporal horizon
Abstract- We introduce metrics on sensorimotor experience at various temporal scales based on informationtheory. Sensorimotor variables through which the experience of an agent fl...
Chrystopher L. Nehaniv
UAI
2000
13 years 6 months ago
PEGASUS: A policy search method for large MDPs and POMDPs
We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...
Andrew Y. Ng, Michael I. Jordan
DAGSTUHL
2001
13 years 6 months ago
Decision-Theoretic Control of Planetary Rovers
Planetary rovers are small unmanned vehicles equipped with cameras and a variety of sensors used for scientific experiments. They must operate under tight constraints over such res...
Shlomo Zilberstein, Richard Washington, Daniel S. ...
ICRA
2008
IEEE
191views Robotics» more  ICRA 2008»
13 years 11 months ago
Combining automated on-line segmentation and incremental clustering for whole body motions
Abstract— This paper describes a novel approach for incremental learning of human motion pattern primitives through on-line observation of human motion. The observed motion time ...
Dana Kulic, Wataru Takano, Yoshihiko Nakamura