Search Sciweavers | Sciweavers

76

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

15 years 10 months ago

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

93

click to vote

ACL
2011

181views Computational Linguistics» more ACL 2011»

Semi-supervised latent variable models for sentence-level sentiment analysis

14 years 1 months ago

Download ryanmcd.com

We derive two variants of a semi-supervised model for ﬁne-grained sentiment analysis. Both models leverage abundant natural supervision in the form of review ratings, as well as...

Oscar Täckström, Ryan T. McDonald

claim paper

Read More »

90

click to vote

SIGIR
2011
ACM

189views Information Technology» more SIGIR 2011»

Fast context-aware recommendations with factorization machines

14 years 17 days ago

Download www.inf.uni-konstanz.de

The situation in which a choice is made is an important information for recommender systems. Context-aware recommenders take this information into account to make predictions. So ...

Steffen Rendle, Zeno Gantner, Christoph Freudentha...

claim paper

Read More »

57

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

15 years 10 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

69

click to vote

IJCNN
2006
IEEE

121views Neural Networks» more IJCNN 2006»

Learning a Rendezvous Task with Dynamic Joint Action Perception

15 years 3 months ago

Download axon.cs.byu.edu

Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...

Nancy Fulda, Dan Ventura

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers