Search Sciweavers | Sciweavers

24 search results - page 3 / 5

» Recommendation as a Stochastic Sequential Decision Problem

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

14 years 6 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

116

click to vote

JAIR
2008

107views more JAIR 2008»

Planning with Durative Actions in Stochastic Domains

14 years 11 months ago

Download www.cs.washington.edu

Probabilistic planning problems are typically modeled as a Markov Decision Process (MDP). MDPs, while an otherwise expressive model, allow only for sequential, non-durative action...

Mausam, Daniel S. Weld

claim paper

Read More »

101

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Stochastic scheduling of active support vector learning algorithms

15 years 5 months ago

Download www-users.cs.umn.edu

Active learning is a generic approach to accelerate training of classiﬁers in order to achieve a higher accuracy with a small number of training examples. In the past, simple ac...

Gaurav Pandey, Himanshu Gupta, Pabitra Mitra

claim paper

Read More »

click to vote

AAMAS
2010
Springer

129views Intelligent Agents» more AAMAS 2010»

Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

14 years 11 months ago

Download anytime.cs.umass.edu

POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their computational complexity, howeve...

Christopher Amato, Daniel S. Bernstein, Shlomo Zil...

claim paper

Read More »

105

click to vote

CORR
2011
Springer

202views Education» more CORR 2011»

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

14 years 6 months ago

Download www.ualberta.ca

The analysis of online least squares estimation is at the heart of many stochastic sequential decision-making problems. We employ tools from the self-normalized processes to provi...

Yasin Abbasi-Yadkori, Dávid Pál, Csa...

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers