Search Sciweavers | Sciweavers

92 search results - page 6 / 19

» Acting Optimally in Partially Observable Stochastic Domains

140

Voted

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 2 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

click to vote

ATAL
2004
Springer

120views Intelligent Agents» more ATAL 2004»

Communication for Improving Policy Computation in Distributed POMDPs

15 years 7 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (POMDPs) are emerging as a popular approach for modeling multiagent teamwork where a group of agents work together to joi...

Ranjit Nair, Milind Tambe, Maayan Roth, Makoto Yok...

claim paper

Read More »

click to vote

AMC
2008

88views more AMC 2008»

Stopping rules for box-constrained stochastic global optimization

15 years 2 months ago

Download zeus.cs.uoi.gr

We present three new stopping rules for Multistart based methods. The first uses a device that enables the determination of the coverage of the bounded search domain. The second i...

Isaac E. Lagaris, Ioannis G. Tsoulos

claim paper

Read More »

127

click to vote

DATE
2008
IEEE

136views Hardware» more DATE 2008»

A Framework of Stochastic Power Management Using Hidden Markov Model

15 years 8 months ago

Download www.date-conference.com

- The effectiveness of stochastic power management relies on the accurate system and workload model and effective policy optimization. Workload modeling is a machine learning proce...

Ying Tan, Qinru Qiu

claim paper

Read More »

135

Voted

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

15 years 8 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 6 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers