Sciweavers

1454 search results - page 40 / 291
» Learning and Extending Sublanguages
Sort
View
IDEAL
2004
Springer
15 years 7 months ago
Policy Gradient Method for Team Markov Games
The main aim of this paper is to extend the single-agent policy gradient method for multiagent domains where all agents share the same utility function. We formulate these team pro...
Ville Könönen
ECTEL
2008
Springer
15 years 3 months ago
A Flexible and Tailorable Architecture for Scripts in F2F Collaboration
In this paper we introduce the architecture of the script engine of a collaborative co-located discussion support system, named CoFFEE, and, in particular, we describe its extendib...
Furio Belgiorno, Rosario De Chiara, Ilaria Manno, ...
GECCO
2007
Springer
186views Optimization» more  GECCO 2007»
15 years 8 months ago
Knowledge reuse in genetic programming applied to visual learning
We propose a method of knowledge reuse for an ensemble of genetic programming-based learners solving a visual learning task. First, we introduce a visual learning method that uses...
Wojciech Jaskowski, Krzysztof Krawiec, Bartosz Wie...
BC
2005
71views more  BC 2005»
15 years 1 months ago
The spatiotemporal learning rule and its efficiency in separating spatiotemporal patterns
The hippocampus plays an important role in the course of establishing long-term memory, i.e., to make short-term memory of spatially and temporally associated input information. In...
M. Tsukada, X. Pan
ICML
2005
IEEE
16 years 2 months ago
Reinforcement learning with Gaussian processes
Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...
Yaakov Engel, Shie Mannor, Ron Meir