Sciweavers

92 search results - page 19 / 19
» Acting Optimally in Partially Observable Stochastic Domains
Sort
View
ML
1998
ACM
101views Machine Learning» more  ML 1998»
13 years 4 months ago
Elevator Group Control Using Multiple Reinforcement Learning Agents
Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...
Robert H. Crites, Andrew G. Barto
ATAL
2005
Springer
13 years 10 months ago
Exploiting belief bounds: practical POMDPs for personal assistant agents
Agents or agent teams deployed to assist humans often face the challenges of monitoring the state of key processes in their environment (including the state of their human users t...
Pradeep Varakantham, Rajiv T. Maheswaran, Milind T...