Search Sciweavers | Sciweavers

24 search results - page 3 / 5

» Recommendation as a Stochastic Sequential Decision Problem

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

13 years 5 days ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

click to vote

JAIR
2008

107views more JAIR 2008»

Planning with Durative Actions in Stochastic Domains

13 years 5 months ago

Download www.cs.washington.edu

Probabilistic planning problems are typically modeled as a Markov Decision Process (MDP). MDPs, while an otherwise expressive model, allow only for sequential, non-durative action...

Mausam, Daniel S. Weld

claim paper

Read More »

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Stochastic scheduling of active support vector learning algorithms

13 years 10 months ago

Download www-users.cs.umn.edu

Active learning is a generic approach to accelerate training of classiﬁers in order to achieve a higher accuracy with a small number of training examples. In the past, simple ac...

Gaurav Pandey, Himanshu Gupta, Pabitra Mitra

claim paper

Read More »

click to vote

AAMAS
2010
Springer

129views Intelligent Agents» more AAMAS 2010»

Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

13 years 5 months ago

Download anytime.cs.umass.edu

POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their computational complexity, howeve...

Christopher Amato, Daniel S. Bernstein, Shlomo Zil...

claim paper

Read More »

click to vote

CORR
2011
Springer

202views Education» more CORR 2011»

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

13 years 6 days ago

Download www.ualberta.ca

The analysis of online least squares estimation is at the heart of many stochastic sequential decision-making problems. We employ tools from the self-normalized processes to provi...

Yasin Abbasi-Yadkori, Dávid Pál, Csa...

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers