Sciweavers

16 search results - page 2 / 4
» A formula of equations of states in singular learning machin...
Sort
View
ICML
1996
IEEE
13 years 9 months ago
A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning
This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...
Rémi Munos
ICML
2005
IEEE
14 years 5 months ago
Beyond the point cloud: from transductive to semi-supervised learning
Due to its occurrence in engineering domains and implications for natural learning, the problem of utilizing unlabeled data is attracting increasing attention in machine learning....
Vikas Sindhwani, Partha Niyogi, Mikhail Belkin
ICML
1996
IEEE
14 years 5 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore
ICML
2003
IEEE
14 years 5 months ago
Marginalized Kernels Between Labeled Graphs
A new kernel function between two labeled graphs is presented. Feature vectors are defined as the counts of label paths produced by random walks on graphs. The kernel computation ...
Hisashi Kashima, Koji Tsuda, Akihiro Inokuchi
ICML
2006
IEEE
14 years 5 months ago
Kernel Predictive Linear Gaussian models for nonlinear stochastic dynamical systems
The recent Predictive Linear Gaussian model (or PLG) improves upon traditional linear dynamical system models by using a predictive representation of state, which makes consistent...
David Wingate, Satinder P. Singh