Sciweavers

771 search results - page 106 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
ICML
2008
IEEE
16 years 2 months ago
Manifold alignment using Procrustes analysis
In this paper we introduce a novel approach to manifold alignment, based on Procrustes analysis. Our approach differs from "semisupervised alignment" in that it results ...
Chang Wang, Sridhar Mahadevan
ICML
2003
IEEE
16 years 2 months ago
Exploration in Metric State Spaces
We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...
Sham Kakade, Michael J. Kearns, John Langford
EVOW
2009
Springer
15 years 8 months ago
Grid Coevolution for Adaptive Simulations: Application to the Building of Opening Books in the Game of Go
This paper presents a successful application of parallel (grid) coevolution applied to the building of an opening book (OB) in 9x9 Go. Known sayings around the game of Go are refou...
Pierre Audouard, Guillaume Chaslot, Jean-Baptiste ...
ALDT
2009
Springer
140views Algorithms» more  ALDT 2009»
15 years 8 months ago
Directional Decomposition of Multiattribute Utility Functions
Abstract. Several schemes have been proposed for compactly representing multiattribute utility functions, yet none seems to achieve the level of success achieved by Bayesian and Ma...
Ronen I. Brafman, Yagil Engel
QEST
2008
IEEE
15 years 8 months ago
CaVi -- Simulation and Model Checking for Wireless Sensor Networks
CaVi provides a uniform interface to state-of-the-art simulation methods and formal verification methods for wireless sensor network. Simulation is suitable to examine the behavi...
Athanassios Boulis, Ansgar Fehnker, Matthias Fruth...