Sciweavers

513 search results - page 92 / 103
» Metric learning for reinforcement learning agents
Sort
View
135
Voted
ICML
1999
IEEE
16 years 1 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
114
Voted
ISPE
2003
15 years 1 months ago
Coordination in utility managed multi-agent groups
A two stage approach to co-ordination in a multi-agent society is presented. The first stage involves agents learning to co-ordinate their activities based on local and global uti...
Fernanda Barbosa, José C. Cunha, Omer F. Ra...
122
Voted
IAT
2010
IEEE
14 years 10 months ago
Selecting Operator Queries Using Expected Myopic Gain
When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...
Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...
PE
2011
Springer
215views Optimization» more  PE 2011»
14 years 7 months ago
Energy-aware routing in the Cognitive Packet Network
An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is specified for the ...
Toktam Mahmoodi
99
Voted
AAAI
2010
15 years 1 months ago
The Model-Based Approach to Autonomous Behavior: A Personal View
The selection of the action to do next is one of the central problems faced by autonomous agents. In AI, three approaches have been used to address this problem: the programming-b...
Hector Geffner