Sciweavers

2011 search results - page 174 / 403
» Universal Reinforcement Learning
Sort
View
IADIS
2004
15 years 6 months ago
Organizing Decentralized Support for a Virtual Learning Environment
Aiming at educational innovation and optimalisation, the University of Leuven (K.U.Leuven) introduced a Virtual
Joke Tisaun, Herman Buelens, Jan Vanthienen
GECCO
2009
Springer
150views Optimization» more  GECCO 2009»
15 years 11 months ago
Discrete dynamical genetic programming in XCS
A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results fr...
Richard Preen, Larry Bull
119
Voted
TSMC
2002
136views more  TSMC 2002»
15 years 4 months ago
Expertness based cooperative Q-learning
By using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rules for unseen situations. These benefits would be ...
Majid Nili Ahmadabadi, Masoud Asadpour
137
Voted
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
15 years 10 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
15 years 9 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...