Search Sciweavers | Sciweavers

166 search results - page 31 / 34

» Online model learning in adversarial Markov decision process...

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 2 days ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

15 years 19 days ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

click to vote

NOMS
2008
IEEE

108views Communications» more NOMS 2008»

Autonomic QoS optimization of real-time internet audio using loss prediction and stochastic control

15 years 5 months ago

Download www.mnlab.cs.depaul.edu

— Quality of Internet audio is highly sensitive to packet loss caused by congestion in the links. Packet loss for audio is normally rectiﬁed by adding redundancy using Forward ...

Lopa Roychoudhuri, Ehab S. Al-Shaer

claim paper

Read More »

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 1 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

189

click to vote

AGENTS
2001
Springer

309views Security Privacy» more AGENTS 2001»

Adjustable autonomy in real-world multi-agent environments

15 years 3 months ago

Download www.cs.cmu.edu

Through adjustable autonomy (AA), an agent can dynamically vary the degree to which it acts autonomously, allowing it to exploit human abilities to improve its performance, but wi...

Paul Scerri, David V. Pynadath, Milind Tambe

claim paper

Read More »

« Prev « First page 31 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers