Sciweavers

109 search results - page 20 / 22
» Policy teaching through reward function learning
Sort
View
AIME
2007
Springer
14 years 7 days ago
Variable Selection for Optimal Decision Making
This paper discusses variable selection for medical decision making; in particular decisions regarding when to provide treatment and which treatment to provide. Current variable se...
Lacey Gunter, Ji Zhu, Susan Murphy
ECML
2006
Springer
13 years 9 months ago
Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions
We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...
Sébastien Jodogne, Justus H. Piater
IAT
2008
IEEE
13 years 6 months ago
Scaling Up Multi-agent Reinforcement Learning in Complex Domains
TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...
Dan Xiao, Ah-Hwee Tan
IADIS
2004
13 years 7 months ago
The Intercollege Student Intranet
In this paper we present the Intercollege Intranet, which has been developed within the strategic aims of the College to offer better services to students and faculty and to enhan...
Philippos Pouyioutas, Maria Poveda, George Soleas,...
IJRR
2008
139views more  IJRR 2008»
13 years 6 months ago
Learning to Control in Operational Space
One of the most general frameworks for phrasing control problems for complex, redundant robots is operational space control. However, while this framework is of essential importan...
Jan Peters, Stefan Schaal