Search Sciweavers | Sciweavers

96

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

14 years 11 months ago

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

76

click to vote

ALGOSENSORS
2006
Springer

92views Sensor Networks» more ALGOSENSORS 2006»

Area Based Beaconless Reliable Broadcasting in Sensor Networks

15 years 1 months ago

Download www.site.uottawa.ca

: We consider the broadcasting problem in sensor networks where the nodes have no prior knowledge of their neighbourhood. We describe several Area-based Beaconless Broadcasting Alg...

Francisco Javier Ovalle-Martínez, Amiya Nay...

claim paper

Read More »

70

click to vote

CP
2006
Springer

121views Artificial Intelligence» more CP 2006»

A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem

15 years 1 months ago

Download www.cs.cmu.edu

The max k-armed bandit problem is a recently-introduced online optimization problem with practical applications to heuristic search. Given a set of k slot machines, each yielding p...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

76

click to vote

NIPS
2007

135views Information Technology» more NIPS 2007»

The Price of Bandit Information for Online Optimization

14 years 11 months ago

Download books.nips.cc

In the online linear optimization problem, a learner must choose, in each round, a decision from a set D ⊂ Rn in order to minimize an (unknown and changing) linear cost function...

Varsha Dani, Thomas P. Hayes, Sham Kakade

claim paper

Read More »

75

click to vote

ICML
2003
IEEE

122views Machine Learning» more ICML 2003»

Online Feature Selection using Grafting

15 years 10 months ago

Download www.hpl.hp.com

In the standard feature selection problem, we are given a fixed set of candidate features for use in a learning problem, and must select a subset that will be used to train a mode...

Simon Perkins, James Theiler

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers