Search Sciweavers | Sciweavers

21 search results - page 2 / 5

» Markov Decision Processes with Multiple Long-Run Average Obj...

click to vote

WINET
2010

127views more WINET 2010»

A Markov Decision Process based flow assignment framework for heterogeneous network access

13 years 3 months ago

Download www.stanford.edu

We consider a scenario where devices with multiple networking capabilities access networks with heterogeneous characteristics. In such a setting, we address the problem of efﬁci...

Jatinder Pal Singh, Tansu Alpcan, Piyush Agrawal, ...

claim paper

Read More »

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

13 years 11 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

click to vote

ICIP
2010
IEEE

177views Image Processing» more ICIP 2010»

Distributed classification of multiple observations by consensus

13 years 3 months ago

Download lts4www.epfl.ch

We consider the problem of distributed classification of multiple observations of the same object that are collected in an ad-hoc network of vision sensors. Assuming that each sen...

Effrosini Kokiopoulou, Pascal Frossard

claim paper

Read More »

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

13 years 6 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

ICDCS
2010
IEEE

167views Distributed And Parallel Com...» more ICDCS 2010»

Stochastic Steepest-Descent Optimization of Multiple-Objective Mobile Sensor Coverage

13 years 9 months ago

Download www.cs.purdue.edu

—We propose a steepest descent method to compute optimal control parameters for balancing between multiple performance objectives in stateless stochastic scheduling, wherein the ...

Chris Y. T. Ma, David K. Y. Yau, Nung Kwan Yip, Na...

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers