Search Sciweavers | Sciweavers

44 search results - page 8 / 9

» Learning and Approximating the Optimal Strategy to Commit To

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

13 years 5 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

click to vote

ICML
2009
IEEE

142views Machine Learning» more ICML 2009»

Robust bounds for classification via selective sampling

14 years 6 months ago

Download homes.dsi.unimi.it

We introduce a new algorithm for binary classification in the selective sampling protocol. Our algorithm uses Regularized Least Squares (RLS) as base classifier, and for this reas...

Nicolò Cesa-Bianchi, Claudio Gentile, Franc...

claim paper

Read More »

click to vote

STOC
2005
ACM

129views Algorithms» more STOC 2005»

Learning with attribute costs

14 years 6 months ago

Download www.math.tau.ac.il

We study an extension of the "standard" learning models to settings where observing the value of an attribute has an associated cost (which might be different for differ...

Haim Kaplan, Eyal Kushilevitz, Yishay Mansour

claim paper

Read More »

click to vote

COCO
2005
Springer

99views Algorithms» more COCO 2005»

On the Complexity of Succinct Zero-Sum Games

13 years 11 months ago

Download www.cs.caltech.edu

We study the complexity of solving succinct zero-sum games, i.e., the games whose payoff matrix M is given implicitly by a Boolean circuit C such that M(i, j) = C(i, j). We comple...

Lance Fortnow, Russell Impagliazzo, Valentine Kaba...

claim paper

Read More »

click to vote

TSP
2012

366views Artificial Intelligence» more TSP 2012»

Sensing and Probing Cardinalities for Active Cognitive Radios

12 years 1 months ago

Download www1.i2r.a-star.edu.sg

—In a cognitive radio network, opportunistic spectrum access (OSA) to the underutilized spectrum involves not only sensing the spectrum occupancy but also probing the channel qua...

Thang Van Nguyen, Hyundong Shin, Tony Q. S. Quek, ...

claim paper

Read More »

« Prev « First page 8 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers