Search Sciweavers | Sciweavers

829 search results - page 16 / 166

» A time aggregation approach to Markov decision processes

109

Voted

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 3 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

122

click to vote

ENTCS
2008

91views more ENTCS 2008»

Expressing Priorities and External Probabilities in Process Algebra via Mixed Open/Closed Systems

15 years 1 months ago

Download www.cs.unibo.it

Defining operational semantics for a process algebra is often based either on labeled transition systems that account for interaction with a context or on the so-called reduction ...

Mario Bravetti

claim paper

Read More »

121

click to vote

ISSS
1999
IEEE

121views Hardware» more ISSS 1999»

Event-Driven Power Management of Portable Systems

15 years 6 months ago

Download si2.epfl.ch

The policy optimization problem for dynamic power management has received considerable attention in the recent past. We formulate policy optimization as a constrained optimization...

Tajana Simunic, Giovanni De Micheli, Luca Benini

claim paper

Read More »

128

Voted

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 6 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

139

click to vote

SIAMSC
2008

148views more SIAMSC 2008»

Multilevel Adaptive Aggregation for Markov Chains, with Application to Web Ranking

15 years 1 months ago

Download amath.colorado.edu

A multilevel adaptive aggregation method for calculating the stationary probability vector of an irreducible stochastic matrix is described. The method is a special case of the ada...

Hans De Sterck, Thomas A. Manteuffel, Stephen F. M...

claim paper

Read More »

« Prev « First page 16 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers