Sciweavers

288 search results - page 38 / 58
» Risk-averse dynamic programming for Markov decision processe...
Sort
View
TCOM
2008
128views more  TCOM 2008»
15 years 1 months ago
Cross-Layer Rate and Power Adaptation Strategies for IR-HARQ Systems over Fading Channels with Memory: A SMDP-Based Approach
Abstract--Incremental-redundancy hybrid automatic repeatrequest (IR-HARQ) schemes are proposed in several wireless standards for increased throughput-efficiency and greater reliabi...
Ashok K. Karmokar, Dejan V. Djonin, Vijay K. Bharg...
GLOBECOM
2008
IEEE
15 years 7 months ago
Foresighted Resource Reciprocation Strategies in P2P Networks
—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...
Hyunggon Park, Mihaela van der Schaar
ILP
2007
Springer
15 years 7 months ago
Building Relational World Models for Reinforcement Learning
Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...
Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...
NIPS
2008
15 years 2 months ago
MDPs with Non-Deterministic Policies
Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for probl...
Mahdi Milani Fard, Joelle Pineau
ATAL
2003
Springer
15 years 6 months ago
Autonomy and Agent Deliberation
Abstract. An important aspect of agent autonomy is the decision making capability of the agents. We discuss several issues that agents need to deliberate about in order to decide w...
Mehdi Dastani, Frank Dignum, John-Jules Ch. Meyer