Search Sciweavers | Sciweavers

48 search results - page 5 / 10

» Metrics for Finite Markov Decision Processes

105

click to vote

AIPS
2008

151views Artificial Intelligence» more AIPS 2008»

Criticality Metrics for Distributed Plan and Schedule Management

15 years 3 months ago

Download www.aaai.org

We address the problem of coordinating the plans and schedules for a team of agents in an uncertain and dynamic environment. Bounded rationality, bounded communication, subjectivi...

Rajiv T. Maheswaran, Pedro A. Szekely

claim paper

Read More »

click to vote

NIPS
2008

132views Information Technology» more NIPS 2008»

Bayesian Model of Behaviour in Economic Games

15 years 2 months ago

Download www.gatsby.ucl.ac.uk

Classical game theoretic approaches that make strong rationality assumptions have difficulty modeling human behaviour in economic games. We investigate the role of finite levels o...

Debajyoti Ray, Brooks King-Casas, P. Read Montague...

claim paper

Read More »

168

click to vote

INFOCOM
2011
IEEE

323views Communications» more INFOCOM 2011»

A high-throughput routing metric for reliable multicast in multi-rate wireless mesh networks

14 years 4 months ago

Download www.cse.unsw.edu.au

Abstract—We propose a routing metric for enabling highthroughput reliable multicast in multi-rate wireless mesh networks. This new multicast routing metric, called expected multi...

Xin Zhao, Jun Guo, Chun Tung Chou, Archan Misra, S...

claim paper

Read More »

102

click to vote

FLAIRS
2001

140views Artificial Intelligence» more FLAIRS 2001»

Probabilistic Planning for Behavior-Based Robots

15 years 2 months ago

Download www.atrash.com

Partially Observable Markov Decision Process models (POMDPs) have been applied to low-level robot control. We show how to use POMDPs differently, namely for sensorplanning in the ...

Amin Atrash, Sven Koenig

claim paper

Read More »

106

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 2 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

« Prev « First page 5 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers