Search Sciweavers | Sciweavers

149

AIPS
2007

104views Artificial Intelligence» more AIPS 2007»

Discovering Relational Domain Features for Probabilistic Planning

15 years 8 months ago

In sequential decision-making problems formulated as Markov decision processes, state-value function approximation using domain features is a critical technique for scaling up the...

Jia-Hong Wu, Robert Givan

claim paper

Read More »

142

click to vote

ACMACE
2008
ACM

106views Human Computer Interaction» more ACMACE 2008»

AIRSF: a new entertainment adaptive framework for stress free air travels

15 years 8 months ago

Download www.idemployee.id.tue.nl

In this paper, we present a new entertainment adaptive framework AIRSF for stress free air travels. Based on the passenger's current and target comfort states, user entertain...

Hao Liu, Jun Hu, Matthias Rauterberg

claim paper

Read More »

172

click to vote

ATAL
2008
Springer

134views Intelligent Agents» more ATAL 2008»

MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

15 years 8 months ago

Download www.cs.utexas.edu

Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need...

Doran Chakraborty, Sandip Sen

claim paper

Read More »

143

click to vote

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

15 years 8 months ago

Download www.cs.washington.edu

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

185

click to vote

CPAIOR
2008
Springer

198views Operations Research» more CPAIOR 2008»

Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization

15 years 7 months ago

Download cs.brown.edu

The one-step anticipatory algorithm (1s-AA) is an online algorithm making decisions under uncertainty by ignoring future non-anticipativity constraints. It makes near-optimal decis...

Luc Mercier, Pascal Van Hentenryck

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers