Sciweavers

1080 search results - page 91 / 216
» Problem dependent optimization (PDO)
Sort
View
JMLR
2010
148views more  JMLR 2010»
14 years 11 months ago
A Generalized Path Integral Control Approach to Reinforcement Learning
With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal
SIGMETRICS
2012
ACM
248views Hardware» more  SIGMETRICS 2012»
13 years 6 months ago
Pricing cloud bandwidth reservations under demand uncertainty
In a public cloud, bandwidth is traditionally priced in a pay-asyou-go model. Reflecting the recent trend of augmenting cloud computing with bandwidth guarantees, we consider a n...
Di Niu, Chen Feng, Baochun Li
SIAMCO
2008
70views more  SIAMCO 2008»
15 years 4 months ago
Minimal Time Sequential Batch Reactors with Bounded and Impulse Controls for One or More Species
We consider the optimal control problem of feeding in minimal time a tank where several species compete for a single resource, with the objective being to reach a given level of th...
Pedro Gajardo, Héctor Ramírez Cabrer...
CORR
2007
Springer
93views Education» more  CORR 2007»
15 years 4 months ago
Simultaneous Communication of Data and State
We consider the problem of transmitting data at rate R over a state dependent channel p(ylx, s) with the state information available at the sender and at the same time conveying th...
Thomas M. Cover, Young-Han Kim, Arak Sutivong
TPDS
2008
88views more  TPDS 2008»
15 years 4 months ago
A Two-Hop Solution to Solving Topology Mismatch
The efficiency of Peer-to-Peer (P2P) systems is largely dependent on the overlay constructions. Due to the random selection of logical neighbors, there often exists serious topolog...
Yunhao Liu