Search Sciweavers | Sciweavers

371 search results - page 41 / 75

» The Complexity of Decentralized Control of Markov Decision P...

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 1 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

click to vote

ATAL
2006
Springer

107views Intelligent Agents» more ATAL 2006»

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces

15 years 3 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...

Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...

claim paper

Read More »

click to vote

NAACL
2007

125views Computational Linguistics» more NAACL 2007»

Comparing User Simulation Models For Dialog Strategy Learning

15 years 1 months ago

Download www.cs.pitt.edu

This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strate...

Hua Ai, Joel R. Tetreault, Diane J. Litman

claim paper

Read More »

click to vote

ICMCS
2006
IEEE

219views Multimedia» more ICMCS 2006»

Analysis of Multi-User Congestion Control for Video Streaming Over Wireless Networks

15 years 5 months ago

Download www.stanford.edu

When multiple video sources are live-encoded and transmitted over a common wireless network, each stream needs to adapt its encoding parameters to wireless channel ﬂuctuations, ...

Xiaoqing Zhu, Bernd Girod

claim paper

Read More »

click to vote

HYBRID
2010
Springer

136views Control Systems» more HYBRID 2010»

On a control algorithm for time-varying processor availability

15 years 6 months ago

Download www2.ee.kth.se

We consider an anytime control algorithm for the situation when the processor resource availability is time-varying. The basic idea is to calculate the components of the control i...

Vijay Gupta

claim paper

Read More »

« Prev « First page 41 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers