Search Sciweavers | Sciweavers

29 search results - page 4 / 6

» Balancing Exploration and Exploitation: A New Algorithm for ...

click to vote

TSP
2012

366views Artificial Intelligence» more TSP 2012»

Sensing and Probing Cardinalities for Active Cognitive Radios

12 years 1 months ago

Download www1.i2r.a-star.edu.sg

—In a cognitive radio network, opportunistic spectrum access (OSA) to the underutilized spectrum involves not only sensing the spectrum occupancy but also probing the channel qua...

Thang Van Nguyen, Hyundong Shin, Tony Q. S. Quek, ...

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 5 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 6 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

13 years 12 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

click to vote

ATAL
2010
Springer

152views Intelligent Agents» more ATAL 2010»

Learning context conditions for BDI plan selection

13 years 6 months ago

Download www.cs.rmit.edu.au

An important drawback to the popular Belief, Desire, and Intentions (BDI) paradigm is that such systems include no element of learning from experience. In particular, the so-calle...

Dhirendra Singh, Sebastian Sardiña, Lin Pad...

claim paper

Read More »

« Prev « First page 4 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers