Search Sciweavers | Sciweavers

449 search results - page 20 / 90

» Finding Structure in Reinforcement Learning

223

Voted

CAMP
2005
IEEE

203views Computer Architecture» more CAMP 2005»

Reinforcement Learning for P2P Searching

16 years 1 months ago

Download sixearch.org

— For a peer-to-peer (P2P) system holding massive amount of data, an efﬁcient and scalable search for resource sharing is a key determinant to its practical usage. Unstructured...

Luca Gatani, Giuseppe Lo Re, Alfonso Urso, Salvato...

claim paper

Read More »

219

click to vote

ICANN
2010
Springer

166views Neural Networks» more ICANN 2010»

Exploring Continuous Action Spaces with Diffusion Trees for Reinforcement Learning

15 years 8 months ago

Download www.tu-ilmenau.de

We propose a new approach for reinforcement learning in problems with continuous actions. Actions are sampled by means of a diffusion tree, which generates samples in the continuou...

Christian Vollmer, Erik Schaffernicht, Horst-Micha...

claim paper

Read More »

180

click to vote

ICML
2005
IEEE

145views Machine Learning» more ICML 2005»

Proto-value functions: developmental reinforcement learning

16 years 8 months ago

Download www.cs.umass.edu

This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...

Sridhar Mahadevan

claim paper

Read More »

192

click to vote

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

15 years 9 months ago

Download www.cs.washington.edu

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

178

click to vote

COLT
2008
Springer

135views Machine Learning» more COLT 2008»

Finding Metric Structure in Information Theoretic Clustering

15 years 9 months ago

Download colt2008.cs.helsinki.fi

We study the problem of clustering discrete probability distributions with respect to the Kullback-Leibler (KL) divergence. This problem arises naturally in many applications. Our...

Kamalika Chaudhuri, Andrew McGregor

claim paper

Read More »

« Prev « First page 20 / 90 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers