Sciweavers

604 search results - page 95 / 121
» Stochastic Offline Programming
Sort
View
WSC
2008
14 years 12 months ago
Simulation as a tool for life cycle cost analysis
Life cycle cost is an essential approach to decide on alternative rehabilitation strategies for infrastructure systems. Monte Carlo simulation approach is used to develop a stocha...
Khaled Shahata, Tarek Zayed
CG
2006
Springer
14 years 11 months ago
Feature Construction for Reinforcement Learning in Hearts
Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...
Nathan R. Sturtevant, Adam M. White
NIPS
2008
14 years 11 months ago
The Infinite Factorial Hidden Markov Model
We introduce a new probability distribution over a potentially infinite number of binary Markov chains which we call the Markov Indian buffet process. This process extends the IBP...
Jurgen Van Gael, Yee Whye Teh, Zoubin Ghahramani
FLAIRS
1998
14 years 11 months ago
Optimizing Production Manufacturing Using Reinforcement Learning
Manyindustrial processes involve makingparts with an assemblyof machines, where each machinecarries out an operation on a part, and the finished product requires a wholeseries of ...
Sridhar Mahadevan, Georgios Theocharous
AAAI
1994
14 years 11 months ago
Cost-Effective Sensing during Plan Execution
Between sensing the world after every action (as in a reactive plan) and not sensing at all (as in an openloop plan), lies a continuum of strategies for sensing during plan execut...
Eric A. Hansen