Search Sciweavers | Sciweavers

118

Voted

AIPS
2009

144views Artificial Intelligence» more AIPS 2009»

Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities

15 years 2 months ago

When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) framework, it is often impossible to obtain a completely accurate estimate of tr...

Karina Valdivia Delgado, Scott Sanner, Leliane Nun...

claim paper

Read More »

121

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 1 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

181

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

13 years 9 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

99

click to vote

MANSCI
2008

116views more MANSCI 2008»

Call Center Outsourcing: Coordinating Staffing Level and Service Quality

15 years 1 months ago

Download faculty.washington.edu

In this paper, we study the contracting issues in an outsourcing supply chain consisting of a user company and a call center that does outsourcing work for the user company. We mo...

Z. Justin Ren, Yong-Pin Zhou

claim paper

Read More »

110

click to vote

COR
2007

86views more COR 2007»

Sourcing with random yields and stochastic demand: A newsvendor approach

15 years 1 months ago

Download web.njit.edu

We studied a supplier selection problem, where a buyer, while facing random demand, is to decide ordering quantities from a set of suppliers with different yields and prices.We pr...

Shitao Yang, Jian Yang, Layek Abdel-Malek

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers