Search Sciweavers | Sciweavers

27 search results - page 3 / 6

» Efficient Behavior Learning Based on State Value Estimation ...

click to vote

ICML
2003
IEEE

171views Machine Learning» more ICML 2003»

Learning To Cooperate in a Social Dilemma: A Satisficing Approach to Bargaining

14 years 6 months ago

Download www.aaai.org

Learning in many multi-agent settings is inherently repeated play. This calls into question the naive application of single play Nash equilibria in multi-agent learning and sugges...

Jeff L. Stimpson, Michael A. Goodrich

claim paper

Read More »

click to vote

ICCAD
1999
IEEE

95views Hardware» more ICCAD 1999»

Dynamic power management using adaptive learning tree

13 years 9 months ago

Download dtl.yonsei.ac.kr

Dynamic Power Management (DPM) is a technique to reduce power consumption of electronic systems by selectively shutting down idle components. The quality of the shutdown control a...

Eui-Young Chung, Luca Benini, Giovanni De Micheli

claim paper

Read More »

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

13 years 11 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

click to vote

JMLR
2010

137views more JMLR 2010»

Importance Sampling for Continuous Time Bayesian Networks

12 years 12 months ago

Download jmlr.csail.mit.edu

A continuous time Bayesian network (CTBN) uses a structured representation to describe a dynamic system with a finite number of states which evolves in continuous time. Exact infe...

Yu Fan, Jing Xu, Christian R. Shelton

claim paper

Read More »

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

13 years 6 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 3 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers