Search Sciweavers | Sciweavers

813 search results - page 80 / 163

» Ensemble Algorithms in Reinforcement Learning

136

click to vote

AAAI
2007

85views Intelligent Agents» more AAAI 2007»

Restart Schedules for Ensembles of Problem Instances

15 years 6 months ago

Download www.cs.cmu.edu

The mean running time of a Las Vegas algorithm can often be dramatically reduced by periodically restarting it with a fresh random seed. The optimal restart schedule depends on th...

Matthew J. Streeter, Daniel Golovin, Stephen F. Sm...

claim paper

Read More »

127

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 8 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

144

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 3 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

174

click to vote

NECO
2007

258views more NECO 2007»

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

15 years 3 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (...

Razvan V. Florian

claim paper

Read More »

142

click to vote

ATAL
2009
Springer

167views Intelligent Agents» more ATAL 2009»

Solving multiagent assignment Markov decision processes

15 years 11 months ago

Download www.aamas-conference.org

We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

« Prev « First page 80 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers