Search Sciweavers | Sciweavers

226 search results - page 18 / 46

» Linear Bayesian Reinforcement Learning

155

click to vote

AAAI
2010

134views Intelligent Agents» more AAAI 2010»

Reinforcement Learning Via Practice and Critique Advice

15 years 8 months ago

Download web.engr.oregonstate.edu

We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our setting, the learner alternates between practicing, where learning is based on ac...

Kshitij Judah, Saikat Roy, Alan Fern, Thomas G. Di...

claim paper

Read More »

175

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

195

Voted

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 10 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

230

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

13 years 8 months ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

145

click to vote

CIVR
2007
Springer

98views Image Analysis» more CIVR 2007»

Semantics reinforcement and fusion learning for multimedia streams

16 years 19 days ago

Download wang.ist.psu.edu

Fusion of multimedia streams for enhanced performance is a critical problem for retrieval. However, fusion performance tends to easily overﬁt the hillclimb set used to learn fus...

Dhiraj Joshi, Milind R. Naphade, Apostol Natsev

claim paper

Read More »

« Prev « First page 18 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers