Sciweavers

682 search results - page 105 / 137
» One-Counter Markov Decision Processes
Sort
View
136
Voted
ICML
1999
IEEE
16 years 1 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
120
Voted
AIPS
2007
15 years 3 months ago
Learning to Plan Using Harmonic Analysis of Diffusion Models
This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...
Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...
95
Voted
ATAL
2005
Springer
15 years 6 months ago
Using decision-theoretic models to enhance agent system survivability
A survivable agent system depends on the incorporation of many recovery features. However, the optimal use of these features requires the ability to assess the actual state of the...
Anthony R. Cassandra, Marian H. Nodine, Shilpa Bon...
UML
2001
Springer
15 years 5 months ago
UML Modelling and Performance Analysis of Mobile Software Architectures
Modern distributed software applications generally operate in complex and heterogeneous computing environments (like the World Wide Web). Different paradigms (client-server, mobili...
Vincenzo Grassi, Raffaela Mirandola
111
Voted
ICASSP
2009
IEEE
15 years 4 months ago
Evolution of social P2P networks based on the dynamics of heterogeneous multimedia peers
In this paper, we consider social peer-to-peer (P2P) networks, where peers are sharing their resources (i.e., multimedia content and upload bandwidth). In the considered P2P netwo...
Hyunggon Park, Mihaela van der Schaar