Search Sciweavers | Sciweavers

79 search results - page 15 / 16

» Discrete time process algebra with silent step

Voted

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

14 years 11 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

101

click to vote

COMPGEOM
2004
ACM

164views Discrete Geometry» more COMPGEOM 2004»

A 2D kinetic triangulation with near-quadratic topological changes

15 years 5 months ago

Download www.cse.ohio-state.edu

Given a set of n points S in the plane, a triangulation of S is a subdivision of the convex hull into triangles whose vertices are from S. In the kinetic setting, the input point ...

Pankaj K. Agarwal, Yusu Wang, Hai Yu

claim paper

Read More »

136

click to vote

TASE
2010
IEEE

231views Software Engineering» more TASE 2010»

Coverage of a Planar Point Set With Multiple Robots Subject to Geometric Constraints

14 years 6 months ago

Download www.cs.rpi.edu

This paper focuses on the assignment of discrete points among K robots and determining the order in which the points should be processed by the robots, in the presence of geometric...

Nilanjan Chakraborty, Srinivas Akella, John T. Wen

claim paper

Read More »

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 1 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

100

Voted

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

14 years 10 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

« Prev « First page 15 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers