Search Sciweavers | Sciweavers

771 search results - page 14 / 155

» Markov Decision Processes with Arbitrary Reward Processes

140

click to vote

JAIR
2006

157views more JAIR 2006»

Decision-Theoretic Planning with non-Markovian Rewards

15 years 1 months ago

Download www.jair.org

A decision process in which rewards depend on history rather than merely on the current state is called a decision process with non-Markovian rewards (NMRDP). In decisiontheoretic...

Sylvie Thiébaux, Charles Gretton, John K. S...

claim paper

Read More »

119

Voted

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

15 years 8 months ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

105

click to vote

ICMAS
2000

146views Intelligent Agents» more ICMAS 2000»

Communication in Multi-Agent Markov Decision Processes

15 years 3 months ago

Download mas.cs.umass.edu

In this paper, we formulate agent's decision process under the framework of Markov decision processes, and in particular, the multi-agent extension to Markov decision process...

Ping Xuan, Victor R. Lesser, Shlomo Zilberstein

claim paper

Read More »

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 2 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

Voted

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 8 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

« Prev « First page 14 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers