Sciweavers

1227 search results - page 67 / 246
» Learning Rates for Q-Learning
Sort
View
AAAI
2007
15 years 5 months ago
RETALIATE: Learning Winning Policies in First-Person Shooter Games
In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team firstperson shooter games. RETALIATE has three crucial chara...
Megan Smith, Stephen Lee-Urban, Hector Muño...
106
Voted
EMNLP
2008
15 years 4 months ago
Learning to Predict Code-Switching Points
Predicting possible code-switching points can help develop more accurate methods for automatically processing mixed-language text, such as multilingual language models for speech ...
Thamar Solorio, Yang Liu
116
Voted
NIPS
2007
15 years 4 months ago
An online Hebbian learning rule that performs Independent Component Analysis
Independent component analysis (ICA) is a powerful method to decouple signals. Most of the algorithms performing ICA do not consider the temporal correlations of the signal, but o...
Claudia Clopath, André Longtin, Wulfram Ger...
145
Voted
GRC
2008
IEEE
15 years 4 months ago
Neighborhood Smoothing Embedding for Noisy Manifold Learning
Manifold learning can discover the structure of high dimensional data and provides understanding of multidimensional patterns by preserving the local geometric characteristics. Ho...
Guisheng Chen, Junsong Yin, Deyi Li
160
Voted
ML
2008
ACM
152views Machine Learning» more  ML 2008»
15 years 3 months ago
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...
András Antos, Csaba Szepesvári, R&ea...