Search Sciweavers | Sciweavers

397 search results - page 67 / 80

» Reinforcement Learning with Hierarchies of Machines

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

14 years 11 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

click to vote

ATAL
2009
Springer

166views Intelligent Agents» more ATAL 2009»

Comparing trust mechanisms for monitoring aggregator nodes in sensor networks

15 years 6 months ago

Download www.aamas-conference.org

Sensor nodes are often used to collect data from locations inaccessible or hazardous for humans. As they are not under normal supervision, these nodes are particularly susceptible...

Oly Mistry, Anil Gürsel, Sandip Sen

claim paper

Read More »

click to vote

ALT
2004
Springer

120views Machine Learning» more ALT 2004»

Prediction with Expert Advice by Following the Perturbed Leader for General Weights

15 years 8 months ago

Download www.idsia.ch

When applying aggregating strategies to Prediction with Expert Advice, the learning rate must be adaptively tuned. The natural choice of complexity/current loss renders the analys...

Marcus Hutter, Jan Poland

claim paper

Read More »

click to vote

ALT
2009
Springer

161views Machine Learning» more ALT 2009»

Iterative Learning from Texts and Counterexamples Using Additional Information

15 years 8 months ago

Download www.comp.nus.edu.sg

Abstract. A variant of iterative learning in the limit (cf. [LZ96]) is studied when a learner gets negative examples refuting conjectures containing data in excess of the target la...

Sanjay Jain, Efim B. Kinber

claim paper

Read More »

click to vote

BMEI
2008
IEEE

153views Biomedical Imaging» more BMEI 2008»

A Retrospective Comparative Study of Three Data Modelling Techniques in Anticoagulation Therapy

15 years 6 months ago

Download eprints.lancs.ac.uk

Three types of data modelling technique are applied retrospectively to individual patients’ anticoagulation therapy data to predict their future levels of anticoagulation. The r...

Simon McDonald, Costas S. Xydeas, Plamen P. Angelo...

claim paper

Read More »

« Prev « First page 67 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers