Search Sciweavers | Sciweavers

85

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 6 days ago

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

104

click to vote

COLT
2010
Springer

183views Machine Learning» more COLT 2010»

Regret Minimization With Concept Drift

14 years 8 months ago

Download www.seas.upenn.edu

In standard online learning, the goal of the learner is to maintain an average loss that is "not too big" compared to the loss of the best-performing function in a fixed...

Koby Crammer, Yishay Mansour, Eyal Even-Dar, Jenni...

claim paper

Read More »

72

click to vote

ICRA
2008
IEEE

119views Robotics» more ICRA 2008»

Towards schema-based, constructivist robot learning: Validating an evolutionary search algorithm for schema chunking

15 years 5 months ago

Download www.cs.utk.edu

— In this paper, we lay the groundwork for extending our previously developed ASyMTRe architecture to enable constructivist learning for multi-robot team tasks. The ASyMTRe archi...

Yifan Tang, Lynne E. Parker

claim paper

Read More »

80

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 3 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

126

click to vote

ESANN
2006

262views Neural Networks» more ESANN 2006»

Random Forests Feature Selection with K-PLS: Detecting Ischemia from Magnetocardiograms

15 years 5 days ago

Download www.dice.ucl.ac.be

Random Forests were introduced by Breiman for feature (variable) selection and improved predictions for decision tree models. The resulting model is often superior to AdaBoost and ...

Long Han, Mark J. Embrechts, Boleslaw K. Szymanski...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers