Sciweavers

805 search results - page 90 / 161
» The Measurable Space of Stochastic Processes
Sort
View
ICRA
2007
IEEE
128views Robotics» more  ICRA 2007»
15 years 4 months ago
Adaptive Play Q-Learning with Initial Heuristic Approximation
Abstract— The problem of an effective coordination of multiple autonomous robots is one of the most important tasks of the modern robotics. In turn, it is well known that the lea...
Andriy Burkov, Brahim Chaib-draa
3DPVT
2006
IEEE
188views Visualization» more  3DPVT 2006»
15 years 1 months ago
Statistical Inference of Biological Structure and Point Spread Functions in 3D Microscopy
We present a novel method for detecting and quantifying 3D structure in stacks of microscopic images captured at incremental focal lengths. We express the image data as stochastic...
Joseph Schlecht, Kobus Barnard, Barry Pryor
AAAI
2007
15 years 5 days ago
Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games
In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...
Colin McMillen, Manuela M. Veloso
ATAL
2008
Springer
14 years 12 months ago
Searching for approximate equilibria in empirical games
When exploring a game over a large strategy space, it may not be feasible or cost-effective to evaluate the payoff of every relevant strategy profile. For example, determining a p...
Patrick R. Jordan, Yevgeniy Vorobeychik, Michael P...
NIPS
2003
14 years 11 months ago
Approximate Policy Iteration with a Policy Language Bias
We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...
Alan Fern, Sung Wook Yoon, Robert Givan