Search Sciweavers | Sciweavers

270 search results - page 37 / 54

» Estimation of non-stationary Markov Chain transition models

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

14 years 10 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

107

click to vote

CVPR
2001
IEEE

256views Computer Vision» more CVPR 2001»

Texture Replacement in Real Images

16 years 1 months ago

Download vision.cse.psu.edu

Texture replacement in real images has many applications, such as interior design, digital movie making and computer graphics. The goal is to replace some specified texture patter...

Yanghai Tsin, Yanxi Liu, Visvanathan Ramesh

claim paper

Read More »

106

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

15 years 2 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

128

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 18 days ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

click to vote

SPAA
1990
ACM

134views Distributed And Parallel Com...» more SPAA 1990»

Analysis of Multithreaded Architectures for Parallel Computing

15 years 3 months ago

Download www.cs.berkeley.edu

Multithreading has been proposed as an architectural strategy for tolerating latency in multiprocessors and, through limited empirical studies, shown to offer promise. This paper ...

Rafael H. Saavedra-Barrera, David E. Culler, Thors...

claim paper

Read More »

« Prev « First page 37 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers