Sciweavers

326 search results - page 36 / 66
» Reinforcement Learning Based on On-Line EM Algorithm
Sort
View
ATAL
2007
Springer
15 years 4 months ago
Dynamic task allocation within an open service-oriented MAS architecture
A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding t...
Ivan Jureta, Stéphane Faulkner, Youssef Ach...
JMLR
2010
119views more  JMLR 2010»
14 years 4 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
ECML
2005
Springer
15 years 3 months ago
Model-Based Online Learning of POMDPs
Abstract. Learning to act in an unknown partially observable domain is a difficult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony
AAAI
2007
15 years 4 days ago
Active Imitation Learning
Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...
Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao
IJCNN
2007
IEEE
15 years 4 months ago
A Constructive-Fuzzy System Modeling for Time Series Forecasting
— This paper suggests a constructive fuzzy system modeling for time series prediction. The model proposed is based on Takagi-Sugeno system and it comprises two phases. First, a f...
Ivette Luna, Secundino Soares, Rosangela Ballini