Sciweavers

255 search results - page 10 / 51
» On Online Learning of Decision Lists
Sort
View
EWRL
2008
15 years 3 months ago
Markov Decision Processes with Arbitrary Reward Processes
Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...
Jia Yuan Yu, Shie Mannor, Nahum Shimkin
129
Voted
ICML
2009
IEEE
16 years 2 months ago
Online feature elicitation in interactive optimization
Most models of utility elicitation in decision support and interactive optimization assume a predefined set of "catalog" features over which user preferences are express...
Craig Boutilier, Kevin Regan, Paolo Viappiani
115
Voted
NIPS
2007
15 years 3 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
ISDA
2010
IEEE
14 years 11 months ago
Intelligent online case-based planning agent model for real-time strategy games
Research in learning and planning in real-time strategy (RTS) games is very interesting in several industries such as military industry, robotics, and most importantly game industr...
Ibrahim Fathy, Mostafa Aref, Omar Enayet, Abdelrah...
AAAI
2010
15 years 3 months ago
To Max or Not to Max: Online Learning for Speeding Up Optimal Planning
It is well known that there cannot be a single "best" heuristic for optimal planning in general. One way of overcoming this is by combining admissible heuristics (e.g. b...
Carmel Domshlak, Erez Karpas, Shaul Markovitch