Sciweavers

771 search results - page 44 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
124
Voted
NECO
2008
108views more  NECO 2008»
15 years 1 months ago
Optimization of Decision Making in Multilayer Networks: The Role of Locus Coeruleus
Previous theoretical work has shown that a single layer neural network can implement the optimal decision process for simple, two alternative forced choice (2AFC) tasks. However, ...
Eric Shea-Brown, Mark S. Gilzenrat, Jonathan D. Co...
NECO
2007
150views more  NECO 2007»
15 years 1 months ago
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...
Dorit Baras, Ron Meir
ATAL
2008
Springer
15 years 3 months ago
Controlling deliberation in a Markov decision process-based agent
Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision...
George Alexander, Anita Raja, David J. Musliner
112
Voted
WISE
2002
Springer
15 years 6 months ago
An MDP-based Peer-to-Peer Search Server Network
A distributed search system consists of a large number of autonomous search servers logically connected in a peerto-peer network. Each search server maintains a local index of a c...
Yipeng Shen, Dik Lun Lee
120
Voted
IJCAI
2003
15 years 3 months ago
A Planning Algorithm for Predictive State Representations
We address the problem of optimally controlling stochastic environments that are partially observable. The standard method for tackling such problems is to define and solve a Part...
Masoumeh T. Izadi, Doina Precup