Search Sciweavers | Sciweavers

137

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

15 years 4 months ago

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

123

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 4 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

124

click to vote

ML
2002
ACM

114views Machine Learning» more ML 2002»

Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts

15 years 4 months ago

Download www.cs.ou.edu

The execution order of a block of computer instructions on a pipelined machine can make a difference in running time by a factor of two or more. Compilers use heuristic schedulers...

Amy McGovern, J. Eliot B. Moss, Andrew G. Barto

claim paper

Read More »

89

click to vote

ML
2002
ACM

135views Machine Learning» more ML 2002»

Bayesian Treed Models

15 years 4 months ago

Download math.acadiau.ca

When simple parametric models such as linear regression fail to adequately approximate a relationship across an entire set of data, an alternative may be to consider a partition o...

Hugh A. Chipman, Edward I. George, Robert E. McCul...

claim paper

Read More »

158

click to vote

ML
2002
ACM

163views Machine Learning» more ML 2002»

Structural Modelling with Sparse Kernels

15 years 4 months ago

Download users.ecs.soton.ac.uk

A widely acknowledged drawback of many statistical modelling techniques, commonly used in machine learning, is that the resulting model is extremely difficult to interpret. A numb...

Steve R. Gunn, Jaz S. Kandola

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers