Search Sciweavers | Sciweavers

387 search results - page 4 / 78

» Improving Reinforcement Learning by Using Case Based Heurist...

Voted

AIPS
2008

95views Artificial Intelligence» more AIPS 2008»

Learning Heuristic Functions through Approximate Linear Programming

15 years 1 months ago

Download anytime.cs.umass.edu

Planning problems are often formulated as heuristic search. The choice of the heuristic function plays a significant role in the performance of planning systems, but a good heuris...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

14 years 10 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

Voted

ICCBR
2009
Springer

159views Automated Reasoning» more ICCBR 2009»

Case-Based Reasoning in Transfer Learning

15 years 6 months ago

Download www.knexusresearch.com

Positive transfer learning (TL) occurs when, after gaining experience from learning how to solve a (source) task, the same learner can exploit this experience to improve performanc...

David W. Aha, Matthew Molineaux, Gita Sukthankar

claim paper

Read More »

Voted

NIPS
1994

90views Information Technology» more NIPS 1994»

Reinforcement Learning with Soft State Aggregation

15 years 29 days ago

Download www.eecs.umich.edu

It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

174

click to vote

NIPS
2008

149views Information Technology» more NIPS 2008»

Optimization on a Budget: A Reinforcement Learning Approach

15 years 1 months ago

Download www.cs.arizona.edu

Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...

Paul Ruvolo, Ian R. Fasel, Javier R. Movellan

claim paper

Read More »

« Prev « First page 4 / 78 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers