Search Sciweavers | Sciweavers

651 search results - page 90 / 131

» Algorithms for Inverse Reinforcement Learning

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 5 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

click to vote

COLT
1992
Springer

110views Machine Learning» more COLT 1992»

Query by Committee

15 years 4 months ago

Download hebb.mit.edu

We propose an algorithm called query by committee, in which a committee of students is trained on the same data set. The next query is chosen according to the principle of maximal...

H. Sebastian Seung, Manfred Opper, Haim Sompolinsk...

claim paper

Read More »

110

click to vote

ICA
2010
Springer

256views Signal Processing» more ICA 2010»

Dictionary Learning for Sparse Representations: A Pareto Curve Root Finding Approach

15 years 29 days ago

Download www.see.ed.ac.uk

Abstract. A new dictionary learning method for exact sparse representation is presented in this paper. As the dictionary learning methods often iteratively update the sparse coeffi...

Mehrdad Yaghoobi, Mike E. Davies

claim paper

Read More »

113

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

15 years 5 months ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

121

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

15 years 5 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

« Prev « First page 90 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers