Search Sciweavers | Sciweavers

771 search results - page 129 / 155

» Markov Decision Processes with Arbitrary Reward Processes

123

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 2 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

104

Voted

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 2 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

110

Voted

ICML
2002
IEEE

128views Machine Learning» more ICML 2002»

Pruning Improves Heuristic Search for Cost-Sensitive Learning

16 years 2 months ago

Download web.engr.oregonstate.edu

This paper addresses cost-sensitive classification in the setting where there are costs for measuring each attribute as well as costs for misclassification errors. We show how to ...

Valentina Bayer Zubek, Thomas G. Dietterich

claim paper

Read More »

123

click to vote

MOBIHOC
2008
ACM

136views Computer Networks» more MOBIHOC 2008»

Routing in a cyclic mobispace

16 years 1 months ago

Download www.cse.fau.edu

A key challenge of routing in delay tolerant networks (DTNs) is to find routes that have high delivery rates and low endto-end delays. When oracles are not available for future co...

Cong Liu, Jie Wu

claim paper

Read More »

114

click to vote

HYBRID
2010
Springer

136views Control Systems» more HYBRID 2010»

On a control algorithm for time-varying processor availability

15 years 8 months ago

Download www2.ee.kth.se

We consider an anytime control algorithm for the situation when the processor resource availability is time-varying. The basic idea is to calculate the components of the control i...

Vijay Gupta

claim paper

Read More »

« Prev « First page 129 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers