Sciweavers

1340 search results - page 216 / 268
» Kalman Temporal Differences
Sort
View
E4MAS
2006
Springer
15 years 3 months ago
Spatially Distributed Normative Infrastructure
Abstract. In previous works we have presented a model to describe and simulate environment for situated multi-agent systems, that we called ELMS. Here, we present an extensions to ...
Fabio Y. Okuyama, Rafael H. Bordini, Antônio...
EUROPAR
2006
Springer
15 years 3 months ago
Specification of Inefficiency Patterns for MPI-2 One-Sided Communication
Abstract. Automatic performance analysis of parallel programs can be accomplished by scanning event traces of program execution for patterns representing inefficient behavior. The ...
Andrej Kühnal, Marc-André Hermanns, Be...
GECCO
2006
Springer
133views Optimization» more  GECCO 2006»
15 years 3 months ago
On-line evolutionary computation for reinforcement learning in stochastic domains
In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...
Shimon Whiteson, Peter Stone
AAAI
2007
15 years 2 months ago
Discovering Multivariate Motifs using Subsequence Density Estimation and Greedy Mixture Learning
The problem of locating motifs in real-valued, multivariate time series data involves the discovery of sets of recurring patterns embedded in the time series. Each set is composed...
David Minnen, Charles Lee Isbell Jr., Irfan A. Ess...
ATAL
2008
Springer
15 years 1 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...