Search Sciweavers | Sciweavers

1233 search results - page 225 / 247

» Reinforcement Learning in MirrorBot

116

click to vote

IAT
2005
IEEE

138views Intelligent Agents» more IAT 2005»

Multiagent Reputation Management to Achieve Robust Software Using Redundancy

15 years 7 months ago

Download www.cse.sc.edu

This paper explains the building of robust software using multiagent reputation. One of the major goals of software engineering is to achieve robust software. Our hypothesis is th...

Rajesh Turlapati, Michael N. Huhns

claim paper

Read More »

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 2 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

127

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

16 years 2 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

106

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 7 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

130

click to vote

ACSE
2000
ACM

271views Theoretical Computer Science» more ACSE 2000»

The information environments program - a new design based IT degree

15 years 5 months ago

Download www.itee.uq.edu.au

The University of Queensland has recently established a new design-focused, studio-based IT degree at a new “flexible-learning” campus. The Bachelor of Information Environment...

Michael Docherty, Peter Sutton, Margot Brereton, S...

claim paper

Read More »

« Prev « First page 225 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers