Sciweavers

1340 search results - page 124 / 268
» Kalman Temporal Differences
Sort
View
CG
2002
Springer
15 years 3 months ago
Learning a Game Strategy Using Pattern-Weights and Self-play
Abstract. This paper demonstrates the use of pattern-weights in order to develop a strategy for an automated player of a non-cooperative version of the game of Diplomacy. Diplomacy...
Ari Shapiro, Gil Fuchs, Robert Levinson
136
Voted
IJON
2002
79views more  IJON 2002»
15 years 3 months ago
Capacity of perirhinal cortex network for recognising frequently repeating stimuli
Much evidence indicates that discrimination of the familiarity of visual stimuli is dependent on the perirhinal cortex of the temporal lobe. A stimulus can become familiar to anim...
Rafal Bogacz, Malcolm W. Brown
JMLR
2002
100views more  JMLR 2002»
15 years 3 months ago
On the Convergence of Optimistic Policy Iteration
We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...
John N. Tsitsiklis
ICRA
2010
IEEE
190views Robotics» more  ICRA 2010»
15 years 2 months ago
Active 3D scene segmentation and detection of unknown objects
Abstract— We present an active vision system for segmentation of visual scenes based on integration of several cues. The system serves as a visual front end for generation of obj...
Mårten Björkman, Danica Kragic
ACL
2009
15 years 1 months ago
The Chinese Aspect Generation Based on Aspect Selection Functions
This paper describes our system for generating Chinese aspect expressions. In the system, the semantics of different aspects is characterized by specific temporal and conceptual f...
Guowen Yang, John A. Bateman