Sciweavers

2861 search results - page 266 / 573
» Parallel Online Learning
Sort
View
NIPS
2000
15 years 5 months ago
Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task
The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...
Brian Sallans, Geoffrey E. Hinton
ICANN
2010
Springer
15 years 5 months ago
Reinforcement Learning Based Neural Controllers for Dynamic Processes without Exploration
Abstract. In this paper we present a Reinforcement Learning (RL) approach with the capability to train neural adaptive controllers for complex control problems without expensive on...
Frank-Florian Steege, André Hartmann, Erik ...
ICASSP
2010
IEEE
15 years 4 months ago
Learning deep rhetorical structure for extractive speech summarization
Extractive summarization of conference and lecture speech is useful for online learning and references. We show for the first time that deep(er) rhetorical parsing of conference ...
Justin Jian Zhang, Pascale Fung
IJLT
2006
89views more  IJLT 2006»
15 years 4 months ago
How do you know they are learning? The importance of alignment in higher education
: The success of any learning environment is determined by the degree to which there is adequate alignment among eight critical factors: 1) goals, 2) content, 3) instructional desi...
Thomas C. Reeves
MMS
2006
15 years 4 months ago
Support vector machine active learning for music retrieval
Searching and organizing growing digital music collections requires a computational model of music similarity. This paper describes a system for performing flexible music similarit...
Michael I. Mandel, Graham E. Poliner, Daniel P. W....