Search Sciweavers | Sciweavers

3050 search results - page 194 / 610

» On-line Algorithms in Machine Learning

147

click to vote

ICML
2007
IEEE

138views Machine Learning» more ICML 2007»

On learning with dissimilarity functions

16 years 4 months ago

Download www.machinelearning.org

We study the problem of learning a classification task in which only a dissimilarity function of the objects is accessible. That is, data are not represented by feature vectors bu...

Liwei Wang, Cheng Yang, Jufu Feng

claim paper

Read More »

143

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 4 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

129

click to vote

ICML
2001
IEEE

126views Machine Learning» more ICML 2001»

Round Robin Rule Learning

16 years 4 months ago

Download www.eecs.wsu.edu

In this paper, we discuss a technique for handling multi-class problems with binary classifiers, namely to learn one classifier for each pair of classes. Although this idea is kno...

Johannes Fürnkranz

claim paper

Read More »

135

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 7 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

105

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 4 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

« Prev « First page 194 / 610 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers