Search Sciweavers | Sciweavers

41 search results - page 8 / 9

» Heuristic Reinforcement Learning Applied to RoboCup Simulati...

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

15 years 27 days ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

116

click to vote

ATAL
2011
Springer

220views Intelligent Agents» more ATAL 2011»

Using iterated reasoning to predict opponent strategies

13 years 11 months ago

Download paul.rutgers.edu

The ﬁeld of multiagent decision making is extending its tools from classical game theory by embracing reinforcement learning, statistical analysis, and opponent modeling. For ex...

Michael Wunder, Michael Kaisers, John Robert Yaros...

claim paper

Read More »

click to vote

CONTEXT
1999
Springer

77views Modeling and Simulation» more CONTEXT 1999»

The Pragmatic Roots of Context

15 years 3 months ago

Download cfpm.org

When modelling complex systems one can not include all the causal factors, but one has to settle for partial models. This is alright if the factors left out are either so constant...

Bruce Edmonds

claim paper

Read More »

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 3 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

click to vote

AAAI
2008

204views Intelligent Agents» more AAAI 2008»

Adaptive Management of Air Traffic Flow: A Multiagent Coordination Approach

15 years 1 months ago

Download www.aaai.org

This paper summarizes recent advances in the application of multiagent coordination algorithms to air traffic flow management. Indeed, air traffic flow management is one of the fu...

Kagan Tumer, Adrian K. Agogino

claim paper

Read More »

« Prev « First page 8 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers