Sciweavers

56 search results - page 9 / 12
» Q-Learning in Continuous State and Action Spaces
Sort
View
93
Voted
ATAL
2008
Springer
15 years 1 months ago
Controlling deliberation in a Markov decision process-based agent
Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision...
George Alexander, Anita Raja, David J. Musliner
ATAL
2005
Springer
15 years 5 months ago
Multi-agent reward analysis for learning in noisy domains
In many multi agent learning problems, it is difficult to determine, a priori, the agent reward structure that will lead to good performance. This problem is particularly pronoun...
Adrian K. Agogino, Kagan Tumer
112
Voted
PUK
2000
15 years 1 months ago
Knowledge-Based Control of Decision Theoretic Planning - Adaptive Planning Model Selection
This paper proposes a new planning architecture for agents operating in uncertain and dynamic environments. Decisiontheoretic planning has been recognized as a useful tool for rea...
Jun Miura, Yoshiaki Shirai
ICML
2001
IEEE
16 years 14 days ago
Direct Policy Search using Paired Statistical Tests
Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...
Malcolm J. A. Strens, Andrew W. Moore
HICSS
2006
IEEE
120views Biometrics» more  HICSS 2006»
15 years 5 months ago
Systems Thinking and Information Literacy: Elements of a Knowledge Enabling Workplace Environment
Dynamic technology-driven circumstances fortify academic librarians’ reconsideration of their professional purposes, processes and relationships. In response, California Polytec...
Mary M. Somerville, Anita Mirijamdotter, Lydia Col...