Search Sciweavers | Sciweavers

40 search results - page 2 / 8

» Markov decision process (MDP) framework for optimizing softw...

click to vote

IJCAI
2007

154views Artificial Intelligence» more IJCAI 2007»

A Hybridized Planner for Stochastic Domains

13 years 6 months ago

Download www.ijcai.org

Markov Decision Processes are a powerful framework for planning under uncertainty, but current algorithms have difﬁculties scaling to large problems. We present a novel probabil...

Mausam, Piergiorgio Bertoli, Daniel S. Weld

claim paper

Read More »

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

13 years 8 months ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

click to vote

JAIR
2006

122views more JAIR 2006»

Solving Factored MDPs with Hybrid State and Action Variables

13 years 5 months ago

Download www.jair.org

Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automa...

Branislav Kveton, Milos Hauskrecht, Carlos Guestri...

claim paper

Read More »

click to vote

KDD
2010
ACM

282views Data Mining» more KDD 2010»

Optimizing debt collections using constrained reinforcement learning

13 years 9 months ago

Download www.prem-melville.com

In this paper, we propose and develop a novel approach to the problem of optimally managing the tax, and more generally debt, collections processes at ﬁnancial institutions. Our...

Naoki Abe, Prem Melville, Cezar Pendus, Chandan K....

claim paper

Read More »

click to vote

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

13 years 11 months ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

« Prev « First page 2 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers