Sciweavers

2005 search results - page 312 / 401
» Decisive Markov Chains
Sort
View
ICML
2008
IEEE
16 years 21 days ago
Manifold alignment using Procrustes analysis
In this paper we introduce a novel approach to manifold alignment, based on Procrustes analysis. Our approach differs from "semisupervised alignment" in that it results ...
Chang Wang, Sridhar Mahadevan
ICML
2003
IEEE
16 years 21 days ago
Exploration in Metric State Spaces
We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...
Sham Kakade, Michael J. Kearns, John Langford
ICML
2001
IEEE
16 years 21 days ago
Continuous-Time Hierarchical Reinforcement Learning
Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...
Mohammad Ghavamzadeh, Sridhar Mahadevan
EVOW
2009
Springer
15 years 6 months ago
Grid Coevolution for Adaptive Simulations: Application to the Building of Opening Books in the Game of Go
This paper presents a successful application of parallel (grid) coevolution applied to the building of an opening book (OB) in 9x9 Go. Known sayings around the game of Go are refou...
Pierre Audouard, Guillaume Chaslot, Jean-Baptiste ...
IFIP
2009
Springer
15 years 6 months ago
HMM-Based Trust Model
Probabilistic trust has been adopted as an approach to taking security sensitive decisions in modern global computing environments. Existing probabilistic trust frameworks either a...
Ehab ElSalamouny, Vladimiro Sassone, Mogens Nielse...