Search Sciweavers | Sciweavers

194

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 6 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

142

click to vote

ICCBR
2009
Springer

159views Automated Reasoning» more ICCBR 2009»

Case-Based Reasoning in Transfer Learning

16 years 25 days ago

Download www.knexusresearch.com

Positive transfer learning (TL) occurs when, after gaining experience from learning how to solve a (source) task, the same learner can exploit this experience to improve performanc...

David W. Aha, Matthew Molineaux, Gita Sukthankar

claim paper

Read More »

188

click to vote

AIMSA
2006
Springer

159views Artificial Intelligence» more AIMSA 2006»

Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying

15 years 10 months ago

Download tcts.fpms.ac.be

Although speech and language processing techniques achieved a relative maturity during the last decade, designing a spoken dialogue system is still a tailoring task because of the ...

Olivier Pietquin

claim paper

Read More »

150

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 8 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

179

click to vote

DAGM
2007
Springer

148views Image Processing» more DAGM 2007»

Efficient Learning of Neural Networks with Evolutionary Algorithms

15 years 10 months ago

Download www.ks.informatik.uni-kiel.de

Abstract. In this article we present EANT2, a method that creates neural networks (NNs) by evolutionary reinforcement learning. The structure of NNs is developed using mutation ope...

Nils T. Siebel, Jochen Krause, Gerald Sommer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers