Sciweavers

408 search results - page 60 / 82
» A lookahead strategy for solving large planning problems
Sort
View
AIPS
2007
14 years 12 months ago
Robust Local Search and Its Application to Generating Robust Schedules
In this paper, we propose an extended local search framework to solve combinatorial optimization problems with data uncertainty. Our approach represents a major departure from sce...
Hoong Chuin Lau, Thomas Ou, Fei Xiao
102
Voted
AAAI
2007
14 years 12 months ago
Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games
In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...
Colin McMillen, Manuela M. Veloso
AI
2011
Springer
14 years 4 months ago
A unifying action calculus
Abstract McCarthy’s Situation Calculus is arguably the oldest special-purpose knowledge representation formalism, designed to axiomatize knowledge of actions and their effects. ...
Michael Thielscher
ICML
2002
IEEE
15 years 10 months ago
Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs
One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...
Carlos Guestrin, Relu Patrascu, Dale Schuurmans
P2P
2009
IEEE
120views Communications» more  P2P 2009»
15 years 4 months ago
A Flexible Divide-And-Conquer Protocol for Multi-View Peer-to-Peer Live Streaming
Abstract—Multi-view peer-to-peer (P2P) live streaming systems have recently emerged, where a user can simultaneously watch multiple channels. Previous work on multi-view P2P stre...
Miao Wang, Lisong Xu, Byrav Ramamurthy