Sciweavers

373 search results - page 44 / 75
» Covariant Policy Search
Sort
View
TSP
2010
14 years 7 months ago
Distributed learning in multi-armed bandit with multiple players
We formulate and study a decentralized multi-armed bandit (MAB) problem. There are distributed players competing for independent arms. Each arm, when played, offers i.i.d. reward a...
Keqin Liu, Qing Zhao
CDC
2010
IEEE
125views Control Systems» more  CDC 2010»
14 years 4 months ago
Persistent patrol with limited-range on-board sensors
— We propose and analyze the Persistent Patrol Problem (PPP). An unmanned aerial vehicle (UAV) moving with constant speed and unbounded acceleration patrols a bounded region of t...
Vu Anh Huynh, John Enright, Emilio Frazzoli
89
Voted
DSN
2008
IEEE
15 years 7 months ago
Scheduling algorithms for unpredictably heterogeneous CMP architectures
In future large-scale multi-core microprocessors, hard errors and process variations will create dynamic heterogeneity, causing performance and power characteristics to differ amo...
Jonathan A. Winter, David H. Albonesi
CSCLP
2006
Springer
15 years 4 months ago
Cost-Based Filtering for Stochastic Inventory Control
Abstract. An interesting class of production/inventory control problems considers a single product and a single stocking location, given a stochastic demand with a known non-statio...
Armagan Tarim, Brahim Hnich, Roberto Rossi, Steven...
102
Voted
PPL
2008
63views more  PPL 2008»
15 years 11 days ago
Using Hardware Multithreading to Overcome Broadcast/Reduction Latency in an Associative SIMD Processor
The latency of broadcast/reduction operations has a significant impact on the performance of SIMD processors. This is especially true for associative programs, which make extensiv...
Kevin Schaffer, Robert A. Walker