Search Sciweavers | Sciweavers

377 search results - page 19 / 76

» Convergence of Stochastic Iterative Dynamic Programming Algo...

187

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

15 years 8 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

173

click to vote

ICPR
2010
IEEE

187views Computer Vision» more ICPR 2010»

Fast Training of Object Detection Using Stochastic Gradient Descent

15 years 9 months ago

Download vca.ele.tue.nl

Training datasets for object detection problems are typically very large and Support Vector Machine (SVM) implementations are computationally complex. As opposed to these complex ...

Rob Wijnhoven, Peter H. N. De With

claim paper

Read More »

200

click to vote

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

15 years 8 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

196

click to vote

WSC
2004

124views Modeling And Simulation» more WSC 2004»

Simulation-Based Optimization Using Simulated Annealing With Confidence Interval

15 years 8 months ago

Download www.informs-sim.org

This paper develops a variant of Simulated Annealing (SA) algorithm for solving discrete stochastic optimization problems where the objective function is stochastic and can be eva...

Talal M. Alkhamis, Mohamed A. Ahmed

claim paper

Read More »

262

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 19 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers