Search Sciweavers | Sciweavers

97

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

15 years 3 days ago

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

87

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

15 years 4 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

97

click to vote

NIPS
2000

149views Information Technology» more NIPS 2000»

Learning Winner-take-all Competition Between Groups of Neurons in Lateral Inhibitory Networks

14 years 11 months ago

Download hebb.mit.edu

It has long been known that lateral inhibition in neural networks can lead to a winner-take-all competition, so that only a single neuron is active at a steady state. Here we show...

Xiaohui Xie, Richard H. R. Hahnloser, H. Sebastian...

claim paper

Read More »

102

click to vote

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

14 years 9 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

99

click to vote

CORR
2000
Springer

120views Education» more CORR 2000»

Scaling Up Inductive Logic Programming by Learning from Interpretations

14 years 9 months ago

Download dtai.cs.kuleuven.be

When comparing inductive logic programming (ILP) and attribute-value learning techniques, there is a trade-off between expressive power and efficiency. Inductive logic programming ...

Hendrik Blockeel, Luc De Raedt, Nico Jacobs, Bart ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers