Sciweavers

8099 search results - page 1326 / 1620
» Higher-Order Task Models
Sort
View
ICML
2005
IEEE
16 years 5 months ago
Exploration and apprenticeship learning in reinforcement learning
We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...
Pieter Abbeel, Andrew Y. Ng
ICML
2002
IEEE
16 years 5 months ago
Hierarchically Optimal Average Reward Reinforcement Learning
Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...
Mohammad Ghavamzadeh, Sridhar Mahadevan
SIGSOFT
2009
ACM
16 years 5 months ago
Static data race detection for concurrent programs with asynchronous calls
A large number of industrial concurrent programs are being designed based on a model which combines threads with event-based communication. These programs consist of several threa...
Vineet Kahlon, Nishant Sinha, Erik Kruus, Yun Zhan...
ISBI
2006
IEEE
16 years 5 months ago
Sketch initialized Snakes for rapid, accurate and repeatable interactive medical image segmentation
We combine a pen and pressure-sensitive tablet input device, and a sketch-based user initialization process, with a general subdivisioncurve Snake to create an intuitive, fast, ac...
Tim McInerney, M. Reza Akhavan Sharif
ISBI
2008
IEEE
16 years 5 months ago
Medial-based Bayesian tracking for vascular segmentation: Application to coronary arteries in 3D CT angiography
We propose a new Bayesian, stochastic tracking algorithm for the segmentation of blood vessels from 3D medical image data. Inspired by the recent developments in particle filterin...
David Lesage, Elsa D. Angelini, Isabelle Bloch, Ga...
« Prev « First page 1326 / 1620 Last » Next »