Search Sciweavers | Sciweavers

575 search results - page 31 / 115

» Reinforcement Learning State Estimator

139

click to vote

ATAL
2008
Springer

127views Intelligent Agents» more ATAL 2008»

Autonomous transfer for reinforcement learning

15 years 7 months ago

Download www.cs.utexas.edu

Recent work in transfer learning has succeeded in making reinforcement learning algorithms more efficient by incorporating knowledge from previous tasks. However, such methods typ...

Matthew E. Taylor, Gregory Kuhlmann, Peter Stone

claim paper

Read More »

110

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 7 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

145

click to vote

DICTA
2007

132views Applied Computing» more DICTA 2007»

Fuzzy Model Based Recognition of Handwritten Hindi Characters

15 years 6 months ago

Download eprints.qut.edu.au

This paper presents the recognition of handwritten Hindi Characters based on the modified exponential membership function fitted to the fuzzy sets derived from features consisting...

Madasu Hanmandlu, O. V. Ramana Murthy, Vamsi Krish...

claim paper

Read More »

120

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

15 years 11 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

123

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 6 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

« Prev « First page 31 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers