Search Sciweavers | Sciweavers

3412 search results - page 174 / 683

» Efficient Reinforcement Learning

101

Voted

AINA
2007
IEEE

144views Computer Networks» more AINA 2007»

Efficient Battery Management for Sensor Lifetime

15 years 7 months ago

Download www.waset.org

Abstract--Monitoring and automatic control of building environment is a crucial application of Wireless Sensor Network (WSN) in which maximizing network lifetime is a key challenge...

Malka Halgamuge

claim paper

Read More »

153

Voted

DIGITEL
2008
IEEE

236views Artificial Intelligence» more DIGITEL 2008»

Adaptive Educational Games: Providing Non-invasive Personalised Learning Experiences

15 years 5 months ago

Download www.mendeley.com

Educational games have the potential to provide intrinsically motivating learning experiences that immerse and engage the learner. However, the much heralded benefits of education...

Neil Peirce, Owen Conlan, Vincent Wade

claim paper

Read More »

128

Voted

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 5 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

159

Voted

GECCO
2008
Springer

144views Optimization» more GECCO 2008»

Self-adaptive constructivism in Neural XCS and XCSF

15 years 4 months ago

Download www.cems.uwe.ac.uk

For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...

Gerard David Howard, Larry Bull, Pier Luca Lanzi

claim paper

Read More »

120

Voted

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 4 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

« Prev « First page 174 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers