Search Sciweavers | Sciweavers

3643 search results - page 34 / 729

» Learning Submodular Functions

208

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 12 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

172

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

174

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

300

click to vote

Book

796views

Introduction to Machine Learning

17 years 5 months ago

Download robotics.stanford.edu

This is an introductory book about machine learning. Notice that this is a draft book. It may contain typos, mistakes, etc. The book covers the following topics: Boolean Functio...

Nils J. Nilsson

posted by scimaster

Read More »

181

click to vote

APPROX
2007
Springer

112views Algorithms» more APPROX 2007»

Encouraging Cooperation in Sharing Supermodular Costs

16 years 15 days ago

Download www.gtcenter.org

Abstract Consider a situation where a group of agents wishes to share the costs of their joint actions, and needs to determine how to distribute the costs amongst themselves in a f...

Andreas S. Schulz, Nelson A. Uhan

claim paper

Read More »

« Prev « First page 34 / 729 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers