Sciweavers

69 search results - page 13 / 14
» PAC-Bayesian Policy Evaluation for Reinforcement Learning
Sort
View
AAAI
2000
13 years 7 months ago
ADVISOR: A Machine Learning Architecture for Intelligent Tutor Construction
We have constructed ADVISOR, a two-agent machine learning architecture for intelligent tutoring systems (ITS). The purpose of this architecture is to centralize the reasoning of a...
Joseph Beck, Beverly Park Woolf, Carole R. Beal
JNW
2006
63views more  JNW 2006»
13 years 6 months ago
MAC Contention in a Wireless LAN with Noncooperative Anonymous Stations
In ad hoc wireless LANs populated by mutually impenetrable groups of anonymous stations, honest stations are prone to "bandwidth stealing" by selfish stations. The proble...
Jerzy Konorski
IAT
2010
IEEE
13 years 4 months ago
Multiagent Meta-level Control for a Network of Weather Radars
It is crucial for embedded systems to adapt to the dynamics of open environments. This adaptation process becomes especially challenging in the context of multiagent systems. In t...
Shanjun Cheng, Anita Raja, Victor R. Lesser
JMLR
2006
124views more  JMLR 2006»
13 years 6 months ago
Policy Gradient in Continuous Time
Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...
Rémi Munos
CORR
2010
Springer
152views Education» more  CORR 2010»
13 years 6 months ago
Neuroevolutionary optimization
Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...
Eva Volná