Search Sciweavers | Sciweavers

945 search results - page 38 / 189

» Dialog Convergence and Learning

click to vote

NIPS
2000

161views Information Technology» more NIPS 2000»

From Margin to Sparsity

14 years 11 months ago

Download users.cecs.anu.edu.au

We present an improvement of Noviko 's perceptron convergence theorem. Reinterpreting this mistakebound as a margindependent sparsity guarantee allows us to give a PAC{style ...

Thore Graepel, Ralf Herbrich, Robert C. Williamson

claim paper

Read More »

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 3 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

click to vote

NIPS
2003

135views Information Technology» more NIPS 2003»

Learning Bounds for a Generalized Family of Bayesian Posterior Distributions

14 years 11 months ago

Download books.nips.cc

In this paper we obtain convergence bounds for the concentration of Bayesian posterior distributions (around the true distribution) using a novel method that simpliﬁes and enhan...

Tong Zhang

claim paper

Read More »

Voted

JAIR
2008

119views more JAIR 2008»

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics

14 years 9 months ago

Download www.ece.utk.edu

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Due to the complexity of the problem, the majority of the previo...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

15 years 10 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

« Prev « First page 38 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers