Search Sciweavers | Sciweavers

166 search results - page 9 / 34

» Online model learning in adversarial Markov decision process...

click to vote

ATAL
2005
Springer

124views Intelligent Agents» more ATAL 2005»

Rapid on-line temporal sequence prediction by an adaptive agent

15 years 4 months ago

Download gandalf.psych.umn.edu

Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning ...

Steven Jensen, Daniel Boley, Maria L. Gini, Paul R...

claim paper

Read More »

click to vote

ICMLA
2009

185views Machine Learning» more ICMLA 2009»

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs

14 years 9 months ago

Download staff.science.uva.nl

Abstract--Feature selection is an important challenge in machine learning. Unfortunately, most methods for automating feature selection are designed for supervised learning tasks a...

Mark Kroon, Shimon Whiteson

claim paper

Read More »

click to vote

ATAL
2008
Springer

134views Intelligent Agents» more ATAL 2008»

MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

15 years 1 months ago

Download www.cs.utexas.edu

Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need...

Doran Chakraborty, Sandip Sen

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 3 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

click to vote

KDD
2010
ACM

282views Data Mining» more KDD 2010»

Optimizing debt collections using constrained reinforcement learning

15 years 3 months ago

Download www.prem-melville.com

In this paper, we propose and develop a novel approach to the problem of optimally managing the tax, and more generally debt, collections processes at ﬁnancial institutions. Our...

Naoki Abe, Prem Melville, Cezar Pendus, Chandan K....

claim paper

Read More »

« Prev « First page 9 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers