Search Sciweavers | Sciweavers

425 search results - page 45 / 85

» Metacognitive Control and Optimal Learning

131

Voted

CIIA
2009

208views Information Technology» more CIIA 2009»

Dynamic Scheduling in Petroleum Process using Reinforcement Learning

15 years 4 months ago

Download sunsite.informatik.rwth-aachen.de

Petroleum industry production systems are highly automatized. In this industry, all functions (e.g., planning, scheduling and maintenance) are automated and in order to remain comp...

Nassima Aissani, Bouziane Beldjilali

claim paper

Read More »

112

click to vote

AI
2001
Springer

118views Artificial Intelligence» more AI 2001»

Imitation and Reinforcement Learning in Agents with Heterogeneous Actions

15 years 7 months ago

Download www.cs.toronto.edu

Reinforcement learning techniques are increasingly being used to solve di cult problems in control and combinatorial optimization with promising results. Implicit imitation can acc...

Bob Price, Craig Boutilier

claim paper

Read More »

111

click to vote

ICML
2010
IEEE

230views Machine Learning» more ICML 2010»

Multi-Task Learning of Gaussian Graphical Models

15 years 4 months ago

Download www.cs.sunysb.edu

We present multi-task structure learning for Gaussian graphical models. We discuss uniqueness and boundedness of the optimal solution of the maximization problem. A block coordina...

Jean Honorio, Dimitris Samaras

claim paper

Read More »

108

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 29 days ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

116

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 4 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 45 / 85 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers