Search Sciweavers | Sciweavers

326 search results - page 36 / 66

» Reinforcement Learning Based on On-Line EM Algorithm

137

click to vote

ATAL
2007
Springer

108views Intelligent Agents» more ATAL 2007»

Dynamic task allocation within an open service-oriented MAS architecture

16 years 1 months ago

Download www.isys.ucl.ac.be

A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding t...

Ivan Jureta, Stéphane Faulkner, Youssef Ach...

claim paper

Read More »

211

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 1 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

173

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

16 years 11 days ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

186

click to vote

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 9 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

209

click to vote

IJCNN
2007
IEEE

140views Neural Networks» more IJCNN 2007»

A Constructive-Fuzzy System Modeling for Time Series Forecasting

16 years 1 months ago

Download www.neural-forecasting-competition.com

— This paper suggests a constructive fuzzy system modeling for time series prediction. The model proposed is based on Takagi-Sugeno system and it comprises two phases. First, a f...

Ivette Luna, Secundino Soares, Rosangela Ballini

claim paper

Read More »

« Prev « First page 36 / 66 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers