Search Sciweavers | Sciweavers

453 search results - page 36 / 91

» Learning from actions not taken: a multiagent learning algor...

123

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 17 days ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 25 days ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

113

click to vote

SBIA
2000
Springer

172views Artificial Intelligence» more SBIA 2000»

User profiling with Case-Based Reasoning and Bayesian Networks

15 years 3 months ago

Download users.exa.unicen.edu.ar

Agent technology provides many services to users. The tasks in which agents are involved include information filtering, information retrieval, user's tasks automation, browsin...

Silvia N. Schiaffino, Analía Amandi

claim paper

Read More »

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 5 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

click to vote

ESANN
2008

136views Neural Networks» more ESANN 2008»

Generalized matrix learning vector quantizer for the analysis of spectral data

15 years 1 months ago

Download www.dice.ucl.ac.be

The analysis of spectral data constitutes new challenges for machine learning algorithms due to the functional nature of the data. Special attention is paid to the metric used in t...

Petra Schneider, Frank-Michael Schleif, Thomas Vil...

claim paper

Read More »

« Prev « First page 36 / 91 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers