Sciweavers

147 search results - page 5 / 30
» Policy Gradient in Continuous Time
Sort
View
ICML
2001
IEEE
14 years 6 months ago
Continuous-Time Hierarchical Reinforcement Learning
Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...
Mohammad Ghavamzadeh, Sridhar Mahadevan
IOR
2010
90views more  IOR 2010»
13 years 4 months ago
No-Holdback Allocation Rules for Continuous-Time Assemble-to-Order Systems
This paper analyzes a class of common-component allocation rules, termed no-holdback (NHB) rules, in continuous-review assemble-to-order (ATO) systems. We assume that component in...
Yingdong Lu, Jing-Sheng Song, Yao Zhao
ATAL
2006
Springer
13 years 9 months ago
Gradient field-based task assignment in an AGV transportation system
Assigning tasks to agents is complex, especially in highly dynamic environments. Typical protocol-based approaches for task assignment such as Contract Net have proven their value...
Danny Weyns, Nelis Boucké, Tom Holvoet
DAIS
2010
13 years 7 months ago
gradienTv: Market-Based P2P Live Media Streaming on the Gradient Overlay
This paper presents gradienTv, a distributed, market-based approach to live streaming. In gradienTv, multiple streaming trees are constructed using a market-based approach, such th...
Amir H. Payberah, Jim Dowling, Fatemeh Rahimian, S...
JCAM
2011
91views more  JCAM 2011»
12 years 8 months ago
Numerical solution of linear Volterra integral equations of the second kind with sharp gradients
Collocation methods are a well developed approach for the numerical solution of smooth and weakly-singular Volterra integral equations. In this paper we extend these methods, thro...
Samuel A. Isaacson, Robert M. Kirby