The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...
This paper demonstrates the use of simulation in an evaluative study for the technology of liver transplantation from cost-effectiveness point of view. This study is conducted in ...
Lynne P. Baldwin, Tillal Eldabi, Ray J. Paul, Andr...
Learning during backtrack search is a space-intensive process that records information (such as additional constraints) in order to avoid redundant work. In this paper, we analyze...
Effective task-level control is critical for robots that are to engage in purposeful activity in realworld environments. This paper describes PRSLite, a task-level controller grou...
The goal of this study is to evaluate the potential for using large vocabulary continuous speech recognition as an engine for automatically classifying utterances according to the...
Steve Lowe, Anne Demedts, Larry Gillick, Mark Mand...