Sciweavers

451 search results - page 2 / 91
» Performance evaluation with temporal rewards
Sort
View
AAAI
2011
12 years 5 months ago
Optimal Rewards versus Leaf-Evaluation Heuristics in Planning Agents
Planning agents often lack the computational resources needed to build full planning trees for their environments. Agent designers commonly overcome this finite-horizon approxima...
Jonathan Sorg, Satinder P. Singh, Richard L. Lewis
ENTCS
2006
143views more  ENTCS 2006»
13 years 5 months ago
Component-Oriented Specification of Performance Measures
Formal notations for system performance modeling need to be equipped with suitable notations for specifying performance measures. These companion notations have been traditionally...
Alessandro Aldini, Marco Bernardo
PRDC
1999
IEEE
13 years 9 months ago
Availability and Performance Evaluation for Automatic Protection Switching in TDMA Wireless System
In this paper, we compare the availability and performance of a wireless TDMA system with and without automatic protection switching. Stochastic reward net models are constructed ...
Hairong Sun, Yonghuan Cao, Kishor S. Trivedi, Jame...
ICML
1999
IEEE
14 years 6 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
GECCO
2006
Springer
133views Optimization» more  GECCO 2006»
13 years 9 months ago
On-line evolutionary computation for reinforcement learning in stochastic domains
In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...
Shimon Whiteson, Peter Stone