Sciweavers

453 search results - page 40 / 91
» Learning from actions not taken: a multiagent learning algor...
Sort
View
ICML
2005
IEEE
15 years 10 months ago
A causal approach to hierarchical decomposition of factored MDPs
We present Variable Influence Structure Analysis, an algorithm that dynamically performs hierarchical decomposition of factored Markov decision processes. Our algorithm determines...
Anders Jonsson, Andrew G. Barto
JCP
2008
139views more  JCP 2008»
14 years 9 months ago
Agent Learning in Relational Domains based on Logical MDPs with Negation
In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...
Song Zhiwei, Chen Xiaoping, Cong Shuang
ICMAS
1998
14 years 11 months ago
How to Explore your Opponent's Strategy (almost) Optimally
This work presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent's behavior during interaction in a multi-a...
David Carmel, Shaul Markovitch
ICES
1998
Springer
131views Hardware» more  ICES 1998»
15 years 1 months ago
Aspects of Digital Evolution: Geometry and Learning
In this paper we present a new chromosome representation for evolving digital circuits. The representation is based very closely on the chip architecture of the Xilinx 6216 FPGA. W...
Julian F. Miller, Peter Thomson
BMVC
2010
14 years 7 months ago
Histogram of Body Poses and Spectral Regression Discriminant Analysis for Human Action Categorization
This paper explores a recently proposed and rarely reported subspace learning method, Spectral Regression Discriminant Analysis (SRDA) [1, 2], on silhouette based human action rec...
Ling Shao, Xiuli Chen