Sciweavers

397 search results - page 67 / 80
» Reinforcement Learning with Hierarchies of Machines
Sort
View
ML
2002
ACM
133views Machine Learning» more  ML 2002»
14 years 9 months ago
Finite-time Analysis of the Multiarmed Bandit Problem
Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...
Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...
ATAL
2009
Springer
15 years 4 months ago
Comparing trust mechanisms for monitoring aggregator nodes in sensor networks
Sensor nodes are often used to collect data from locations inaccessible or hazardous for humans. As they are not under normal supervision, these nodes are particularly susceptible...
Oly Mistry, Anil Gürsel, Sandip Sen
ALT
2004
Springer
15 years 6 months ago
Prediction with Expert Advice by Following the Perturbed Leader for General Weights
When applying aggregating strategies to Prediction with Expert Advice, the learning rate must be adaptively tuned. The natural choice of complexity/current loss renders the analys...
Marcus Hutter, Jan Poland
ALT
2009
Springer
15 years 6 months ago
Iterative Learning from Texts and Counterexamples Using Additional Information
Abstract. A variant of iterative learning in the limit (cf. [LZ96]) is studied when a learner gets negative examples refuting conjectures containing data in excess of the target la...
Sanjay Jain, Efim B. Kinber
BMEI
2008
IEEE
15 years 4 months ago
A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy
Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...
Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...