Search Sciweavers | Sciweavers

2990 search results - page 553 / 598

» Hidden Markov processes

176

click to vote

NIPS
2003

145views Information Technology» more NIPS 2003»

A Nonlinear Predictive State Representation

15 years 7 months ago

Download books.nips.cc

Predictive state representations (PSRs) use predictions of a set of tests to represent the state of controlled dynamical systems. One reason why this representation is exciting as...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

145

click to vote

SERP
2003

105views Software Engineering» more SERP 2003»

Reliability Modeling Using UML

15 years 7 months ago

Download xcr.cenit.latech.edu

System reliability has become an increasingly important benchmark in measuring service continuity. As part of many service level agreements, system performance is gauged by how lo...

Chokchai Leangsuksun, Hertong Song, Lixin Shen

claim paper

Read More »

136

click to vote

AIPS
2000

129views Artificial Intelligence» more AIPS 2000»

Representations of Decision-Theoretic Planning Tasks

15 years 7 months ago

Download www.aaai.org

Goal-directed Markov Decision Process models (GDMDPs) are good models for many decision-theoretic planning tasks. They have been used in conjunction with two different reward stru...

Sven Koenig, Yaxin Liu

claim paper

Read More »

191

click to vote

UAI
2000

136views Artificial Intelligence» more UAI 2000»

Fast Planning in Stochastic Games

15 years 7 months ago

Download www.cis.upenn.edu

Stochastic games generalize Markov decision processes MDPs to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards de...

Michael J. Kearns, Yishay Mansour, Satinder P. Sin...

claim paper

Read More »

174

click to vote

ATAL
2010
Springer

115views Intelligent Agents» more ATAL 2010»

Self-organization for coordinating decentralized reinforcement learning

15 years 6 months ago

Download www.cs.umass.edu

Decentralized reinforcement learning (DRL) has been applied to a number of distributed applications. However, one of the main challenges faced by DRL is its convergence. Previous ...

Chongjie Zhang, Victor R. Lesser, Sherief Abdallah

claim paper

Read More »

« Prev « First page 553 / 598 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers