Sciweavers

53 search results - page 10 / 11
» Shaping multi-agent systems with gradient reinforcement lear...
Sort
View
ANOR
2005
80views more  ANOR 2005»
13 years 5 months ago
Entropic Penalties in Finite Games
The main objects here are finite-strategy games in which entropic terms are subtracted from the payoffs. After such subtraction each Nash equilibrium solves an explicit, unconstra...
Sjur Didrik Flåm, E. Cavazzuti
SIGDIAL
2010
13 years 3 months ago
Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy
This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...
Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...
NETCOOP
2007
Springer
13 years 12 months ago
Load Shared Sequential Routing in MPLS Networks: System and User Optimal Solutions
Recently Gerald Ash has shown through case studies that event dependent routing is attractive in large scale multi-service MPLS networks. In this paper, we consider the application...
Gilles Brunet, Fariba Heidari, Lorne Mason
NIPS
2001
13 years 7 months ago
Linking Motor Learning to Function Approximation: Learning in an Unlearnable Force Field
Reaching movements require the brain to generate motor commands that rely on an internal model of the task's dynamics. Here we consider the errors that subjects make early in...
O. Donchin, Reza Shadmehr
CVPR
2011
IEEE
13 years 1 months ago
Learning and Matching Multiscale Template Descriptors for Real-Time Detection, Localization and Tracking
We describe a system to learn an object template from a video stream, and localize and track the corresponding object in live video. The template is decomposed into a number of lo...
Taehee Lee, Stefano Soatto