Search Sciweavers | Sciweavers

166 search results - page 17 / 34

» Online model learning in adversarial Markov decision process...

102

click to vote

CORR
2008
Springer

173views Education» more CORR 2008»

Decomposition Principles and Online Learning in Cross-Layer Optimization for Delay-Sensitive Applications

14 years 11 months ago

Download documents.scribd.com

In this paper, we propose a general cross-layer optimization framework in which we explicitly consider both the heterogeneous and dynamically changing characteristics of delay-sens...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

153

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

15 years 5 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

118

click to vote

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 1 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

121

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 5 days ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

109

click to vote

ICPR
2004
IEEE

120views computer vision» more ICPR 2004»

Joint Spatial and Temporal Structure Learning for Task based Control

16 years 14 days ago

Download www.cogs.susx.ac.uk

We present an example of a joint spatial and temporal task learning algorithm that results in a generative model that has applications for on-line visual control. We review work o...

Hilary Buxton, Kingsley Sage

claim paper

Read More »

« Prev « First page 17 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers