Search Sciweavers | Sciweavers

238 search results - page 42 / 48

» Value-Function Approximations for Partially Observable Marko...

168

click to vote

MOBICOM
2009
ACM

174views Communications» more MOBICOM 2009»

Interference management via rate splitting and HARQ over time-varying fading channels

15 years 12 months ago

Download web.njit.edu

The coexistence of two unlicensed links is considered, where one link interferes with the transmission of the other, over a timevarying, block-fading channel. In the absence of fa...

Marco Levorato, Osvaldo Simeone, Urbashi Mitra

claim paper

Read More »

175

click to vote

GECCO
2008
Springer

179views Optimization» more GECCO 2008»

Emergent architecture in self organized swarm systems for military applications

15 years 6 months ago

Download www.cs.bham.ac.uk

Many sectors of the military are interested in Self-Organized (SO) systems because of their ﬂexibility, versatility and economics. The military is researching and employing auto...

Dustin J. Nowak, Gary B. Lamont, Gilbert L. Peters...

claim paper

Read More »

141

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

15 years 7 months ago

Download www.aaai.org

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

156

click to vote

IJCAI
2001

174views Artificial Intelligence» more IJCAI 2001»

Complexity of Probabilistic Planning under Average Rewards

15 years 6 months ago

Download www.informatik.uni-freiburg.de

A general and expressive model of sequential decision making under uncertainty is provided by the Markov decision processes (MDPs) framework. Complex applications with very large ...

Jussi Rintanen

claim paper

Read More »

173

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 6 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 42 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers