Search Sciweavers | Sciweavers

1310 search results - page 59 / 262

» Progressive Optimization in Action

134

click to vote

AIPS
2000

165views Artificial Intelligence» more AIPS 2000»

Admissible Heuristics for Optimal Planning

15 years 5 months ago

Download www.informatik.uni-freiburg.de

hsp and hspr are two recent planners that search the state-space using an heuristic function extracted from Strips encodings. hsp does a forward search from the initial state reco...

Patrik Haslum, Hector Geffner

claim paper

Read More »

140

click to vote

TSMC
2011

258views more TSMC 2011»

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

14 years 11 months ago

Download www.montefiore.ulg.ac.be

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

109

click to vote

GECCO
2006
Springer

123views Optimization» more GECCO 2006»

The parallel Nash Memory for asymmetric games

15 years 8 months ago

Download people.cs.uu.nl

Coevolutionary algorithms search for test cases as part of the search process. The resulting adaptive evaluation function takes away the need to define a fixed evaluation function...

Frans A. Oliehoek, Edwin D. de Jong, Nikos A. Vlas...

claim paper

Read More »

139

click to vote

CDC
2009
IEEE

132views Control Systems» more CDC 2009»

Q-learning and Pontryagin's Minimum Principle

15 years 9 months ago

Download www.stanford.edu

Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...

Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

144

click to vote

IJCAI
2003

173views Artificial Intelligence» more IJCAI 2003»

A Planning Algorithm for Predictive State Representations

15 years 5 months ago

Download dli.iiit.ac.in

We address the problem of optimally controlling stochastic environments that are partially observable. The standard method for tackling such problems is to define and solve a Part...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

« Prev « First page 59 / 262 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers