Search Sciweavers | Sciweavers

19

ATAL
2004
Springer

168views Intelligent Agents» more ATAL 2004»

Product Distribution Theory for Control of Multi-Agent Systems

13 years 11 months ago

Product Distribution (PD) theory is a new framework for controlling Multi-Agent Systems (MAS’s). First we review one motivation of PD theory, as the information-theoretic extens...

Chiu Fan Lee, David H. Wolpert

claim paper

Read More »

16

click to vote

CCECE
2006
IEEE

155views Electrical And Computer Engi...» more CCECE 2006»

Regularized Fractal Image Decoding

13 years 12 months ago

Download individual.utoronto.ca

The goal of this paper is to present a new recipe for the fractal image decoding process. In this paper, we explain how fractal-based methods can be internally combined with regul...

Mehran Ebrahimi, Edward R. Vrscay

claim paper

Read More »

18

click to vote

PR
2007

189views more PR 2007»

Information cut for clustering using a gradient descent approach

13 years 5 months ago

Download www.phys.uit.no

We introduce a new graph cut for clustering which we call the Information Cut. It is derived using Parzen windowing to estimate an information theoretic distance measure between p...

Robert Jenssen, Deniz Erdogmus, Kenneth E. Hild II...

claim paper

Read More »

19

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

13 years 7 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

14

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

13 years 5 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers