Search Sciweavers | Sciweavers

358 search results - page 24 / 72

» Online Testing with Reinforcement Learning

134

Voted

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 9 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

150

click to vote

CAINE
2008

127views Computer Science» more CAINE 2008»

Scripted Artificially Intelligent Basic Online Tactical Simulation

15 years 6 months ago

Download www.cse.unr.edu

For many years, introductory Computer Science courses have followed the same teaching paradigms. These paradigms utilize only simple console windows; more interactive approaches t...

Jesse D. Phillips, Roger V. Hoang, Joseph D. Mahsm...

claim paper

Read More »

148

Voted

ECML
2004
Springer

154views Machine Learning» more ECML 2004»

Experiments in Value Function Approximation with Sparse Support Vector Regression

15 years 10 months ago

Download userweb.cs.utexas.edu

Abstract. We present ﬁrst experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...

Tobias Jung, Thomas Uthmann

claim paper

Read More »

132

click to vote

ICML
2003
IEEE

119views Machine Learning» more ICML 2003»

Testing Exchangeability On-Line

16 years 5 months ago

Download www.hpl.hp.com

The majority of theoretical work in machine learning is done under the assumption of exchangeability: essentially, it is assumed that the examples are generated from the same prob...

Vladimir Vovk, Ilia Nouretdinov, Alexander Gammerm...

claim paper

Read More »

193

click to vote

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 8 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

« Prev « First page 24 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers