Search Sciweavers | Sciweavers

582 search results - page 66 / 117

» Gaussian Processes in Reinforcement Learning

144

click to vote

ACMICEC
2007
ACM

154views ECommerce» more ACMICEC 2007»

Learning and adaptivity in interactive recommender systems

15 years 8 months ago

Download www.inf.unibz.it

Recommender systems are intelligent E-commerce applications that assist users in a decision-making process by offering personalized product recommendations during an interaction s...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

166

Voted

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 5 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

146

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 6 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

114

Voted

AAAI
2006

118views Intelligent Agents» more AAAI 2006»

Hard Constrained Semi-Markov Decision Processes

15 years 6 months ago

Download www.aaai.org

In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...

Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong

claim paper

Read More »

145

Voted

AAMAS
2002
Springer

128views Intelligent Agents» more AAMAS 2002»

Cooperative Learning Using Advice Exchange

15 years 4 months ago

Download iscte.pt

Abstract. One of the main questions concerning learning in a Multi-Agent System's environment is: "(How) can agents benefit from mutual interaction during the learning pr...

Luís Nunes, Eugenio Oliveira

claim paper

Read More »

« Prev « First page 66 / 117 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers