Search Sciweavers | Sciweavers

267 search results - page 45 / 54

» The Dynamics of Multi-Agent Reinforcement Learning

click to vote

NN
2006
Springer

140views Neural Networks» more NN 2006»

Neural mechanism for stochastic behaviour during a competitive game

14 years 9 months ago

Download wanglab.med.yale.edu

Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...

Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang

claim paper

Read More »

click to vote

IROS
2007
IEEE

172views Robotics» more IROS 2007»

Motor control optimization of compliant one-legged locomotion in rough terrain

15 years 3 months ago

Download groups.csail.mit.edu

— While underactuated robotic systems are capable of energy efﬁcient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for ada...

Fumiya Iida, Russ Tedrake

claim paper

Read More »

click to vote

ICMLA
2004

114views Machine Learning» more ICMLA 2004»

Planning with predictive state representations

14 years 11 months ago

Download www.eecs.umich.edu

Predictive state representation (PSR) models for controlled dynamical systems have recently been proposed as an alternative to traditional models such as partially observable Mark...

Michael R. James, Satinder P. Singh, Michael L. Li...

claim paper

Read More »

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

14 years 4 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

click to vote

AINA
2006
IEEE

179views Computer Networks» more AINA 2006»

Constrained Flooding: A Robust and Efficient Routing Framework for Wireless Sensor Networks

15 years 1 months ago

Download www.parc.com

Flooding protocols for wireless networks in general have been shown to be very inefficient and therefore are mainly used in network initialization or route discovery and maintenan...

Ying Zhang, Markus P. J. Fromherz

claim paper

Read More »

« Prev « First page 45 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers