Search Sciweavers | Sciweavers

2464 search results - page 254 / 493

» Efficient learning equilibrium

126

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

15 years 6 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

108

click to vote

GEM
2008

142views Artificial Intelligence» more GEM 2008»

Evaluating a Parallel Evolutionary Algorithm on the Chess Endgame Problem

15 years 6 months ago

Download www.westmont.edu

Classifying the endgame positions in Chess can be challenging for humans and is known to be a difficult task in machine learning. An evolutionary algorithm would seem to be the ide...

Wayne Iba, Kelsey Marshman, Benjamin Fisk

claim paper

Read More »

150

Voted

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 6 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

152

click to vote

GECCO
2008
Springer

170views Optimization» more GECCO 2008»

Evolving prediction weights using evolution strategy

15 years 5 months ago

Download www.cs.bham.ac.uk

The evolution strategy is one of the strongest evolutionary algorithms for optimizing real-value vectors. In this paper, we study how to use it for the evolution of prediction wei...

Trung Hau Tran, Cédric Sanza, Yves Duthen

claim paper

Read More »

169

click to vote

ICML
2010
IEEE

198views Machine Learning» more ICML 2010»

Probabilistic Backward and Forward Reasoning in Stochastic Relational Worlds

15 years 5 months ago

Download user.cs.tu-berlin.de

Inference in graphical models has emerged as a promising technique for planning. A recent approach to decision-theoretic planning in relational domains uses forward inference in d...

Tobias Lang, Marc Toussaint

claim paper

Read More »

« Prev « First page 254 / 493 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers