Search Sciweavers | Sciweavers

679 search results - page 92 / 136

» Approximate Temporal Aggregation

134

click to vote

IAT
2005
IEEE

132views Intelligent Agents» more IAT 2005»

Decomposing Large-Scale POMDP Via Belief State Analysis

15 years 8 months ago

Download www.comp.hkbu.edu.hk

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...

Xin Li, William K. Cheung, Jiming Liu

claim paper

Read More »

117

click to vote

ATAL
2004
Springer

132views Intelligent Agents» more ATAL 2004»

Decentralized Markov Decision Processes with Event-Driven Interactions

15 years 8 months ago

Download anytime.cs.umass.edu

Decentralized MDPs provide a powerful formal framework for planning in multi-agent systems, but the complexity of the model limits its usefulness. We study in this paper a class o...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse...

claim paper

Read More »

click to vote

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

15 years 8 months ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

113

click to vote

ICPP
1997
IEEE

110views Distributed And Parallel Com...» more ICPP 1997»

Communication in Parallel Applications: Characterization and Sensitivity Analysis

15 years 7 months ago

Download www.cse.psu.edu

Communication characterization of parallel applications is essential to understand the interplay between architectures and applications in determining the maximum achievable perfo...

Dale Seed, Anand Sivasubramaniam, Chita R. Das

claim paper

Read More »

130

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 6 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

« Prev « First page 92 / 136 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers