Search Sciweavers | Sciweavers

326 search results - page 38 / 66

» Reinforcement Learning Based on On-Line EM Algorithm

click to vote

ICML
2003
IEEE

137views Machine Learning» more ICML 2003»

Learning Mixture Models with the Latent Maximum Entropy Principle

15 years 10 months ago

Download www.hpl.hp.com

We present a new approach to estimating mixture models based on a new inference principle we have proposed: the latent maximum entropy principle (LME). LME is different both from ...

Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin...

claim paper

Read More »

click to vote

ADHOCNETS
2010
Springer

276views Computer Networks» more ADHOCNETS 2010»

DCLA: A Duty-Cycle Learning Algorithm for IEEE 802.15.4 Beacon-Enabled WSNs

14 years 6 months ago

Download www.aws.cit.ie

The current specification for IEEE 802.15.4 beacon-enabled networks does not define how active and sleep schedules should be configured in order to achieve the optimal network perf...

Rodolfo de Paz Alberola, Dirk Pesch

claim paper

Read More »

click to vote

NIPS
2007

80views Information Technology» more NIPS 2007»

Stable Dual Dynamic Programming

14 years 11 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

14 years 4 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

click to vote

NIPS
1998

138views Information Technology» more NIPS 1998»

Call-Based Fraud Detection in Mobile Communication Networks Using a Hierarchical Regime-Switching Model

14 years 11 months ago

Download lib.tkk.fi

Fraud causes substantial losses to telecommunication carriers. Detection systems which automatically detect illegal use of the network can be used to alleviate the problem. Previo...

Jaakko Hollmén, Volker Tresp

claim paper

Read More »

« Prev « First page 38 / 66 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers