Sciweavers

12 search results - page 3 / 3
» Biasing Monte-Carlo Simulations through RAVE Values
Sort
View
AAAI
2008
13 years 8 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
WSC
2001
13 years 7 months ago
Commander behavior and course of action selection in JWARS
The Joint Warfare System (JWARS) is being equipped with a Commander Model (CM) to perform situation assessment and Course of Action (COA) selection, and a Commander Behavior Model...
Deborah Vakas, John Prince, H. Ric Blacksten, Chuc...