Markov decision process

We propose a new approach to verification of probabilistic processes for which the model may not be available. We use a technique from Reinforcement Learning to approximate how far...

Josee Desharnais, François Laviolette, Sami...

claim paper

Read More »

95

click to vote

KDD
2010
ACM

282views Data Mining» more KDD 2010»

Optimizing debt collections using constrained reinforcement learning

15 years 5 months ago

Download www.prem-melville.com

In this paper, we propose and develop a novel approach to the problem of optimally managing the tax, and more generally debt, collections processes at ﬁnancial institutions. Our...

Naoki Abe, Prem Melville, Cezar Pendus, Chandan K....

claim paper

Read More »

115

click to vote

ACMICEC
2007
ACM

154views ECommerce» more ACMICEC 2007»

Learning and adaptivity in interactive recommender systems

15 years 5 months ago

Download www.inf.unibz.it

Recommender systems are intelligent E-commerce applications that assist users in a decision-making process by offering personalized product recommendations during an interaction s...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

112

click to vote

ISCC
2000
IEEE

104views Communications» more ISCC 2000»

Dynamic Routing and Wavelength Assignment Using First Policy Iteration

15 years 5 months ago

Download www.netlab.tkk.fi

With standard assumptions the routing and wavelength assignment problem (RWA) can be viewed as a Markov Decision Process (MDP). The problem, however, deﬁes an exact solution bec...

Esa Hyytiä, Jorma T. Virtamo

claim paper

Read More »

138

click to vote

ICVS
2001
Springer

117views Computer Vision» more ICVS 2001»

Adapting Object Recognition across Domains: A Demonstration

15 years 5 months ago

Download www.cs.colostate.edu

High-level vision systems use object, scene or domain specific knowledge to interpret images. Unfortunately, this knowledge has to be acquired for every domain. This makes it diffi...

Bruce A. Draper, Ulrike Ahlrichs, Dietrich Paulus

claim paper

Read More »

146

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 7 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

118

click to vote

FCCM
2006
IEEE

106views VLSI» more FCCM 2006»

Scalable Hardware Architecture for Real-Time Dynamic Programming Applications

15 years 7 months ago

Download www.ece.utk.edu

Abstract— This paper introduces a novel architecture for performing the core computations required by dynamic programming (DP) techniques. The latter pertain to a vast range of a...

Brad Matthews, Itamar Elhanany

claim paper

Read More »

111

click to vote

VTC
2008
IEEE

173views Communications» more VTC 2008»

Adaptive Call Admission Control with Dynamic Resource Reallocation for Cell-Based Multirate Wireless Systems

15 years 7 months ago

Download www.cc.ntut.edu.tw

—This paper studies the admission control and resource allocation in a cell-based wireless system that supports singlemedia and multirate services. Utilizing the idea of adaptive...

Kai-Wei Ke, Chen-Nien Tsai, Ho-Ting Wu, Chia-Hao H...

claim paper

Read More »

88

click to vote

CDC
2008
IEEE

140views Control Systems» more CDC 2008»

Information state for Markov decision processes with network delays

15 years 7 months ago

Download wsl.stanford.edu

We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over wh...

Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers