Search Sciweavers | Sciweavers

224

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 1 months ago

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

124

click to vote

ECAI
2006
Springer

89views Artificial Intelligence» more ECAI 2006»

Learning by Automatic Option Discovery from Conditionally Terminating Sequences

15 years 7 months ago

Download www.ceng.metu.edu.tr

Abstract. This paper proposes a novel approach to discover options in the form of conditionally terminating sequences, and shows how they can be integrated into reinforcement learn...

Sertan Girgin, Faruk Polat, Reda Alhajj

claim paper

Read More »

91

click to vote

EUSFLAT
2003

121views Fuzzy Logic» more EUSFLAT 2003»

An adaptive learning algorithm for a neo fuzzy neuron

15 years 5 months ago

Download www.eusflat.org

In the paper, a new optimal learning algorithm for a neo-fuzzy neuron (NFN) is proposed. The algorithm is characteristic in that it provides online tuning of not only the synaptic...

Yevgeniy Bodyanskiy, Illya Kokshenev, Vitaliy Kolo...

claim paper

Read More »

140

click to vote

SIGECOM
2004
ACM

165views ECommerce» more SIGECOM 2004»

Adaptive limited-supply online auctions

15 years 9 months ago

Download www.cs.cornell.edu

We study a limited-supply online auction problem, in which an auctioneer has k goods to sell and bidders arrive and depart dynamically. We suppose that agent valuations are drawn ...

Mohammad Taghi Hajiaghayi, Robert D. Kleinberg, Da...

claim paper

Read More »

127

click to vote

COLT
2004
Springer

78views Machine Learning» more COLT 2004»

Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary

15 years 9 months ago

Download www.cs.cmu.edu

We give an algorithm for the bandit version of a very general online optimization problem considered by Kalai and Vempala [1], for the case of an adaptive adversary. In this proble...

H. Brendan McMahan, Avrim Blum

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers