Sciweavers

3820 search results - page 332 / 764
» Bounded Model Debugging
Sort
View
ML
2008
ACM
152views Machine Learning» more  ML 2008»
15 years 4 months ago
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...
András Antos, Csaba Szepesvári, R&ea...
TNN
2008
171views more  TNN 2008»
15 years 4 months ago
Adaptive Dynamic Inversion via Time-Scale Separation
Abstract--This paper presents a full state feedback adaptive dynamic inversion method for uncertain systems that depend nonlinearly upon the control input. Using a specialized set ...
Naira Hovakimyan, E. Lavretsky, Chengyu Cao
JCO
2007
100views more  JCO 2007»
15 years 4 months ago
Semi-online scheduling with "end of sequence" information
We study a variant of classical scheduling, which is called scheduling with “end of sequence” information. It is known in advance that the last job has the longest processing ...
Leah Epstein, Deshi Ye
WINET
2008
86views more  WINET 2008»
15 years 4 months ago
Relay sensor placement in wireless sensor networks
This paper addresses the following relay sensor placement problem: given the set of duty sensors in the plane and the upper bound of the transmission range, compute the minimum nu...
Xiuzhen Cheng, Ding-Zhu Du, Lusheng Wang, Baogang ...
PC
2000
100views Management» more  PC 2000»
15 years 4 months ago
Trading accuracy for speed in parallel simulated annealing with simultaneous moves
A common approach to parallelizing simulated annealing to generate several perturbations to the current solution simultaneously, requiring synchronization to guarantee correct eva...
M. D. Durand, Steve R. White