Search Sciweavers | Sciweavers

575 search results - page 78 / 115

» Reinforcement Learning State Estimator

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

14 years 11 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

14 years 9 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

click to vote

ICML
2006
IEEE

146views Machine Learning» more ICML 2006»

Combining discriminative features to infer complex trajectories

15 years 10 months ago

Download www.cs.toronto.edu

We propose a new model for the probabilistic estimation of continuous state variables from a sequence of observations, such as tracking the position of an object in video. This ma...

David A. Ross, Simon Osindero, Richard S. Zemel

claim paper

Read More »

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Integrating organizational control into multi-agent learning

15 years 4 months ago

Download www.aamas-conference.org

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

104

click to vote

DIS
2009
Springer

121views Theoretical Computer Science» more DIS 2009»

OMFP: An Approach for Online Mass Flow Prediction in CFB Boilers

15 years 4 months ago

Download www.win.tue.nl

Abstract. Fuel feeding and inhomogeneity of fuel typically cause process ﬂuctuations in the circulating ﬂuidized bed (CFB) boilers. If control systems fail to compensate the �...

Indre Zliobaite, Jorn Bakker, Mykola Pechenizkiy

claim paper

Read More »

« Prev « First page 78 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers