Search Sciweavers | Sciweavers

199 search results - page 1 / 40

» Efficient Reinforcement Learning with Relocatable Action Mod...

click to vote

AAAI
2007

72views Intelligent Agents» more AAAI 2007»

Efficient Reinforcement Learning with Relocatable Action Models

13 years 8 months ago

Download www.aaai.org

Bethany R. Leffler, Michael L. Littman, Timothy Ed...

claim paper

Read More »

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

13 years 7 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Reinforcement learning agents with primary knowledge designed by analytic hierarchy process

13 years 11 months ago

Download k2x.ice.ous.ac.jp

This paper presents a novel model of reinforcement learning agents. A feature of our learning agent model is to integrate analytic hierarchy process (AHP) into a standard reinforc...

Kengo Katayama, Takahiro Koshiishi, Hiroyuki Narih...

claim paper

Read More »

click to vote

BROADNETS
2004
IEEE

154views Computer Networks» more BROADNETS 2004»

Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning

13 years 9 months ago

Download www.ece.ubc.ca

The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where i...

Fei Yu, Vincent W. S. Wong, Victor C. M. Leung

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

14 years 6 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

« Prev « First page 1 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers