Search Sciweavers | Sciweavers

1340 search results - page 216 / 268

» Kalman Temporal Differences

164

click to vote

E4MAS
2006
Springer

112views Intelligent Agents» more E4MAS 2006»

Spatially Distributed Normative Infrastructure

15 years 9 months ago

Download www.inf.ufrgs.br

Abstract. In previous works we have presented a model to describe and simulate environment for situated multi-agent systems, that we called ELMS. Here, we present an extensions to ...

Fabio Y. Okuyama, Rafael H. Bordini, Antônio...

claim paper

Read More »

156

Voted

EUROPAR
2006
Springer

103views Distributed And Parallel Com...» more EUROPAR 2006»

Specification of Inefficiency Patterns for MPI-2 One-Sided Communication

15 years 9 months ago

Download www.fz-juelich.de

Abstract. Automatic performance analysis of parallel programs can be accomplished by scanning event traces of program execution for patterns representing inefficient behavior. The ...

Andrej Kühnal, Marc-André Hermanns, Be...

claim paper

Read More »

161

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 9 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

160

click to vote

AAAI
2007

140views Intelligent Agents» more AAAI 2007»

Discovering Multivariate Motifs using Subsequence Density Estimation and Greedy Mixture Learning

15 years 8 months ago

Download www.cc.gatech.edu

The problem of locating motifs in real-valued, multivariate time series data involves the discovery of sets of recurring patterns embedded in the time series. Each set is composed...

David Minnen, Charles Lee Isbell Jr., Irfan A. Ess...

claim paper

Read More »

155

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 7 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

« Prev « First page 216 / 268 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers