Search Sciweavers | Sciweavers

1393 search results - page 110 / 279

» Machine Learning by Function Decomposition

134

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

16 years 7 months ago

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...

Leon Barrett, Srini Narayanan

claim paper

Read More »

127

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 7 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

168

Voted

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

157

click to vote

ECML
2007
Springer

171views Machine Learning» more ECML 2007»

Graph-Based Domain Mapping for Transfer Learning in General Games

16 years 13 days ago

Download userweb.cs.utexas.edu

A general game player is an agent capable of taking as input a description of a game’s rules in a formal language and proceeding to play without any subsequent human input. To do...

Gregory Kuhlmann, Peter Stone

claim paper

Read More »

176

click to vote

COLT
2001
Springer

101views Machine Learning» more COLT 2001»

On Learning Monotone DNF under Product Distributions

15 years 10 months ago

Download www.cs.columbia.edu

We show that the class of monotone 2O( √ log n)-term DNF formulae can be PAC learned in polynomial time under the uniform distribution from random examples only. This is an expo...

Rocco A. Servedio

claim paper

Read More »

« Prev « First page 110 / 279 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers