Search Sciweavers | Sciweavers

199 search results - page 24 / 40

» Efficient Reinforcement Learning with Relocatable Action Mod...

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

14 years 11 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

AAAI
2006

136views Intelligent Agents» more AAAI 2006»

Learning Partially Observable Action Schemas

14 years 11 months ago

Download reason.cs.uiuc.edu

We present an algorithm that derives actions' effects and preconditions in partially observable, relational domains. Our algorithm has two unique features: an expressive rela...

Dafna Shahaf, Eyal Amir

claim paper

Read More »

click to vote

EPIA
1995
Springer

110views Artificial Intelligence» more EPIA 1995»

Using Stochastic Grammars to Learn Robotic Tasks

15 years 1 months ago

Download welcome.isr.ist.utl.pt

Abstract. The paper introduces a reinforcement learning-based methodology for performance improvement of Intelligent Controllers. The translation interfaces of a 3-level Hierarchic...

Pedro U. Lima, George N. Saridis

claim paper

Read More »

click to vote

ICML
2008
IEEE

162views Machine Learning» more ICML 2008»

Automatic discovery and transfer of MAXQ hierarchies

15 years 10 months ago

Download pages.cs.wisc.edu

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...

Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...

claim paper

Read More »

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

14 years 11 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

« Prev « First page 24 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers