Search Sciweavers | Sciweavers

2566 search results - page 43 / 514

» Relating reinforcement learning performance to classificatio...

click to vote

SDM
2010
SIAM

158views Data Mining» more SDM 2010»

On the Use of Combining Rules in Relational Probability Trees

15 years 1 months ago

Download lirias.kuleuven.be

A relational probability tree (RPT) is a type of decision tree that can be used for probabilistic classification of instances with a relational structure. Each leaf of an RPT cont...

Daan Fierens

claim paper

Read More »

106

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

15 years 5 months ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

click to vote

ATAL
2006
Springer

135views Intelligent Agents» more ATAL 2006»

Learning the required number of agents for complex tasks

15 years 3 months ago

Download www.damas.ift.ulaval.ca

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa

claim paper

Read More »

107

click to vote

CAEPIA
2011
Springer

188views Artificial Intelligence» more CAEPIA 2011»

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

13 years 11 months ago

Download users.dsic.upv.es

In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...

Javier Insa-Cabrera, David L. Dowe, José He...

claim paper

Read More »

116

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 1 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

« Prev « First page 43 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers