Sciweavers

4446 search results - page 208 / 890
» Learning Observer Agents
Sort
View
COLT
2000
Springer
15 years 8 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
ATAL
2004
Springer
15 years 10 months ago
Best-Response Multiagent Learning in Non-Stationary Environments
This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...
Michael Weinberg, Jeffrey S. Rosenschein
AGENTS
1999
Springer
15 years 8 months ago
Where to Look? Automating Attending Behaviors of Virtual Human Characters
This research proposes a computational framework for generating visual attending behavior in an embodied simulated human agent. Such behaviors directly control eye and head motion...
Sonu Chopra-Khullar, Norman I. Badler
ECAI
2010
Springer
15 years 5 months ago
Kernel Methods for Revealed Preference Analysis
In classical revealed preference analysis we are given a sequence of linear prices (i.e., additive over goods) and an agent's demand at each of the prices. The problem is to d...
Sébastien Lahaie
142
Voted
IDEAL
2000
Springer
15 years 8 months ago
Learning of Virtual Dealers in an Artificial Market: Comparison with Interview Data
Abstract. In this study we used a new agent-based approach, an artificial market approach, to analyze the ways that dealers process the information in financial news. We compared b...
Kiyoshi Izumi, Kazuhiro Ueda