Search Sciweavers | Sciweavers

27 search results - page 5 / 6

» Functional derivation of a virtual machine for delimited con...

128

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 8 days ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

141

Voted

ALT
1999
Springer

119views Machine Learning» more ALT 1999»

Extended Stochastic Complexity and Minimax Relative Loss Analysis

15 years 4 months ago

Download www.ibis.t.u-tokyo.ac.jp

We are concerned with the problem of sequential prediction using a givenhypothesis class of continuously-manyprediction strategies. An eectiveperformance measure is the minimax re...

Kenji Yamanishi

claim paper

Read More »

414

click to vote

Publication

4052views

On the Extraction of Curve Skeletons using Gradient Vector Flow

17 years 1 months ago

Download mecca.louisville.edu

In this paper, we propose a new variational framework for computing continuous curve skeletons from discrete objects that are suitable for structural shape representation. We have...

M. Sabry Hassouna, Aly A. Farag

posted by msabry

Read More »

105

click to vote

ATAL
2009
Springer

161views Intelligent Agents» more ATAL 2009»

Learning a model of speaker head nods using gesture corpora

15 years 7 months ago

Download people.ict.usc.edu

During face-to-face conversation, the speaker’s head is continually in motion. These movements serve a variety of important communicative functions. Our goal is to develop a mod...

Jina Lee, Stacy Marsella

claim paper

Read More »

107

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

15 years 1 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

« Prev « First page 5 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers