Sciweavers

948 search results - page 144 / 190
» Modelling Agents as Observable Sources
Sort
View
91
Voted
AAAI
2007
15 years 2 months ago
Optimizing Anthrax Outbreak Detection Using Reinforcement Learning
The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...
Masoumeh T. Izadi, David L. Buckeridge
92
Voted
AAAI
2008
15 years 2 months ago
Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning
This paper highlights the crucial role that modern machine learning techniques can play in the optimization of treatment strategies for patients with chronic disorders. In particu...
Arthur Guez, Robert D. Vincent, Massimo Avoli, Joe...
120
Voted
AAAI
2010
15 years 2 months ago
Facial Age Estimation by Learning from Label Distributions
One of the main difficulties in facial age estimation is the lack of sufficient training data for many ages. Fortunately, the faces at close ages look similar since aging is a slo...
Xin Geng, Kate Smith-Miles, Zhi-Hua Zhou
122
Voted
AAAI
2006
15 years 2 months ago
Compact, Convex Upper Bound Iteration for Approximate POMDP Planning
Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...
Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...
ATAL
2010
Springer
15 years 1 months ago
MAS-DisCoSim 4 PDP: a testbed for multi-agent solutions to PDPs
This demo illustrates MAS-DisCoSim 4 PDP, a testbed environment for evaluating distributed multi-agent system solutions to pickup and delivery problems (PDPs). PDPs are well-studi...
Jelle Van Gompel, Bart Tuts, Rutger Claes, Mario C...