Sciweavers

199 search results - page 24 / 40
» Efficient Reinforcement Learning with Relocatable Action Mod...
Sort
View
EWRL
2008
14 years 11 months ago
Markov Decision Processes with Arbitrary Reward Processes
Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...
Jia Yuan Yu, Shie Mannor, Nahum Shimkin
AAAI
2006
14 years 11 months ago
Learning Partially Observable Action Schemas
We present an algorithm that derives actions' effects and preconditions in partially observable, relational domains. Our algorithm has two unique features: an expressive rela...
Dafna Shahaf, Eyal Amir
EPIA
1995
Springer
15 years 1 months ago
Using Stochastic Grammars to Learn Robotic Tasks
Abstract. The paper introduces a reinforcement learning-based methodology for performance improvement of Intelligent Controllers. The translation interfaces of a 3-level Hierarchic...
Pedro U. Lima, George N. Saridis
ICML
2008
IEEE
15 years 10 months ago
Automatic discovery and transfer of MAXQ hierarchies
We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...
Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...
NIPS
2008
14 years 11 months ago
Structure Learning in Human Sequential Decision-Making
We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...
Daniel Acuña, Paul R. Schrater