Search Sciweavers | Sciweavers

232 search results - page 2 / 47

» Learning all optimal policies with multiple criteria

click to vote

EOR
2007

133views more EOR 2007»

Using genetic algorithm for dynamic and multiple criteria web-site optimizations

13 years 5 months ago

Download www.lamsade.dauphine.fr

In today’s competitive electronic marketplace, companies try to create long-lasting relations with their online customers. Log ﬁles and registration forms generate millions of...

Arben Asllani, Alireza Lari

claim paper

Read More »

click to vote

ICASSP
2010
IEEE

224views Signal Processing» more ICASSP 2010»

Distributed learning in cognitive radio networks: Multi-armed bandit with distributed multiple players

13 years 5 months ago

Download www.ece.ucdavis.edu

—We consider a cognitive radio network with distributed multiple secondary users, where each user independently searches for spectrum opportunities in multiple channels without e...

Keqin Liu, Qing Zhao

claim paper

Read More »

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

13 years 6 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

click to vote

SECON
2008
IEEE

174views Communications» more SECON 2008»

Optimal Buffer Management Policies for Delay Tolerant Networks

13 years 11 months ago

Download people.ee.ethz.ch

—Delay Tolerant Networks are wireless networks where disconnections may occur frequently due to propagation phenomena, node mobility, and power outages. Propagation delays may al...

Amir Krifa, Chadi Barakat, Thrasyvoulos Spyropoulo...

claim paper

Read More »

click to vote

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

14 years 6 months ago

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

« Prev « First page 2 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers