Search Sciweavers | Sciweavers

682 search results - page 121 / 137

» One-Counter Markov Decision Processes

Voted

ACMACE
2008
ACM

106views Human Computer Interaction» more ACMACE 2008»

AIRSF: a new entertainment adaptive framework for stress free air travels

15 years 2 months ago

Download www.idemployee.id.tue.nl

In this paper, we present a new entertainment adaptive framework AIRSF for stress free air travels. Based on the passenger's current and target comfort states, user entertain...

Hao Liu, Jun Hu, Matthias Rauterberg

claim paper

Read More »

117

click to vote

ATAL
2008
Springer

134views Intelligent Agents» more ATAL 2008»

MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

15 years 2 months ago

Download www.cs.utexas.edu

Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need...

Doran Chakraborty, Sandip Sen

claim paper

Read More »

Voted

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

15 years 2 months ago

Download www.cs.washington.edu

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

Voted

AAAI
2010

163views Intelligent Agents» more AAAI 2010»

Structured Parameter Elicitation

15 years 2 months ago

Download motion.comp.nus.edu.sg

The behavior of a complex system often depends on parameters whose values are unknown in advance. To operate effectively, an autonomous agent must actively gather information on t...

Li Ling Ko, David Hsu, Wee Sun Lee, Sylvie C. W. O...

claim paper

Read More »

108

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

Symbolic Dynamic Programming for First-order POMDPs

15 years 2 months ago

Download www-kd.iai.uni-bonn.de

Partially-observable Markov decision processes (POMDPs) provide a powerful model for sequential decision-making problems with partially-observed state and are known to have (appro...

Scott Sanner, Kristian Kersting

claim paper

Read More »

« Prev « First page 121 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers