Search Sciweavers | Sciweavers

3837 search results - page 66 / 768

» Learning Approximate Consistencies

134

click to vote

ENGL
2007

148views more ENGL 2007»

A General Reflex Fuzzy Min-Max Neural Network

15 years 1 months ago

Download www.engineeringletters.com

—“A General Reflex Fuzzy Min-Max Neural Network” (GRFMN) is presented. GRFMN is capable to extract the underlying structure of the data by means of supervised, unsupervised a...

Abhijeet V. Nandedkar, Prabir Kumar Biswas

claim paper

Read More »

131

Voted

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

15 years 8 months ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

154

click to vote

ISAAC
2005
Springer

127views Algorithms» more ISAAC 2005»

On Complexity and Approximability of the Labeled Maximum/Perfect Matching Problems

15 years 7 months ago

Download www.lamsade.dauphine.fr

In this paper, we deal with both the complexity and the approximability of the labeled perfect matching problem in bipartite graphs. Given a simple graph G = (V, E) with n vertices...

Jérôme Monnot

claim paper

Read More »

115

click to vote

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

15 years 1 months ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

click to vote

ICML
2007
IEEE

162views Machine Learning» more ICML 2007»

Automatic shaping and decomposition of reward functions

16 years 2 months ago

Download www.machinelearning.org

This paper investigates the problem of automatically learning how to restructure the reward function of a Markov decision process so as to speed up reinforcement learning. We begi...

Bhaskara Marthi

claim paper

Read More »

« Prev « First page 66 / 768 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers