Search Sciweavers | Sciweavers

162 search results - page 18 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

120

click to vote

NIPS
2004

128views Information Technology» more NIPS 2004»

A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees

15 years 7 months ago

Download books.nips.cc

We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of p...

Daniela Pucci de Farias, Benjamin Van Roy

claim paper

Read More »

136

click to vote

AAAI
2007

100views Intelligent Agents» more AAAI 2007»

Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization

15 years 8 months ago

Download www.cs.umass.edu

A new spectral approach to value function approximation has recently been proposed to automatically construct basis functions from samples. Global basis functions called proto-val...

Jeffrey Johns, Sridhar Mahadevan, Chang Wang

claim paper

Read More »

285

click to vote

Publication

233views

Sparse reward processes

14 years 4 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

154

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 6 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

145

click to vote

DMKD
2004
ACM

139views Data Mining» more DMKD 2004»

Iterative record linkage for cleaning and integration

15 years 11 months ago

Download www.cs.washington.edu

Record linkage, the problem of determining when two records refer to the same entity, has applications for both data cleaning (deduplication) and for integrating data from multipl...

Indrajit Bhattacharya, Lise Getoor

claim paper

Read More »

« Prev « First page 18 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers