Sciweavers

1227 search results - page 42 / 246
» Learning Rates for Q-Learning
Sort
View
SDM
2007
SIAM
198views Data Mining» more  SDM 2007»
14 years 11 months ago
Learning from Time-Changing Data with Adaptive Windowing
We present a new approach for dealing with distribution change and concept drift when learning from data sequences that may vary with time. We use sliding windows whose size, inst...
Albert Bifet, Ricard Gavaldà
COLING
1992
14 years 11 months ago
Syntactic Ambiguity Resolution Using A Discrimination and Robustness Oriented Adaptive Learning Algorithm
In this paper, a discrimination and robusmess oriented adaptive learning procedure is proposed to deal with the task of syntactic ambiguity resolution. Owing to the problem of ins...
Tung-Hui Chiang, Yi-Chung Lin, Keh-Yih Su
COLT
2000
Springer
15 years 2 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
SIGCSE
1994
ACM
172views Education» more  SIGCSE 1994»
15 years 1 months ago
Collaborative learning in an introductory computer science course
An experiment in collaborative learning was conducted in two introductory programming courses at Loyola College in Maryland. Data collected included background information on stud...
Roberta Evans Sabin, Edward P. Sabin
CONNECTION
2006
101views more  CONNECTION 2006»
14 years 9 months ago
Learning acceptable windows of contingency
By learning a range of possible times over which the effect of an action can take place, a robot can reason more effectively about causal and contingent relationships in the world...
Kevin Gold, Brian Scassellati