Search Sciweavers | Sciweavers

813 search results - page 48 / 163

» Ensemble Algorithms in Reinforcement Learning

149

Voted

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

15 years 5 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

180

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 4 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

129

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

15 years 10 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

119

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 6 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

167

click to vote

TAL
2010
Springer

178views Natural Language Processing» more TAL 2010»

Robust Semi-supervised and Ensemble-Based Methods in Word Sense Disambiguation

15 years 2 months ago

Download cst.dk

Mihalcea [1] discusses self-training and co-training in the context of word sense disambiguation and shows that parameter optimization on individual words was important to obtain g...

Anders Søgaard, Anders Johannsen

claim paper

Read More »

« Prev « First page 48 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers