takes place in an abstract model, connected to therealgamebypassingmessagesbackandforth,asFigure2illustrates.ThegametellstheDMwhenplotpoints occur, and the DM tells the game when i...
Mark J. Nelson, Michael Mateas, David L. Roberts, ...
Parallel programs that use critical sections and are executed on a shared-memory multiprocessor with a writeinvalidate protocol result in invalidation actions that could be elimin...
There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...
This work presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent's behavior during interaction in a multi-a...
Abstract--We present the STack ARchitecture (STAR) automaton. It is a fixed structure, multiaction, reward-penalty learning automaton, characterized by a star-shaped state transiti...