Search Sciweavers | Sciweavers

109

ICML
2006
IEEE

161views Machine Learning» more ICML 2006»

Bayesian regression with input noise for high dimensional data

16 years 4 months ago

This paper examines high dimensional regression with noise-contaminated input and output data. Goals of such learning problems include optimal prediction with noiseless query poin...

Jo-Anne Ting, Aaron D'Souza, Stefan Schaal

claim paper

Read More »

124

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 4 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

119

Voted

ICML
2002
IEEE

127views Machine Learning» more ICML 2002»

Action Refinement in Reinforcement Learning by Probability Smoothing

16 years 4 months ago

Download www.cs.berkeley.edu

In many reinforcement learning applications, the set of possible actions can be partitioned by the programmer into subsets of similar actions. This paper presents a technique for ...

Carles Sierra, Dídac Busquets, Ramon L&oacu...

claim paper

Read More »

137

Voted

ICML
1996
IEEE

119views Machine Learning» more ICML 1996»

Searching for Structure in Multiple Streams of Data

16 years 4 months ago

Download www.cs.arizona.edu

Finding structure in multiple streams of data is an important problem. Consider the streams of data owing from a robot's sensors, the monitors in an intensive care unit, or p...

Tim Oates, Paul R. Cohen

claim paper

Read More »

153

Voted

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 4 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers