Search Sciweavers | Sciweavers

3274 search results - page 425 / 655

» Using Learning in a Control Agent

247

click to vote

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 10 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

230

click to vote

COLT
2010
Springer

183views Machine Learning» more COLT 2010»

Regret Minimization With Concept Drift

15 years 5 months ago

Download www.seas.upenn.edu

In standard online learning, the goal of the learner is to maintain an average loss that is "not too big" compared to the loss of the best-performing function in a fixed...

Koby Crammer, Yishay Mansour, Eyal Even-Dar, Jenni...

claim paper

Read More »

195

click to vote

JSS
2007

115views more JSS 2007»

A case study in re-engineering to enforce architectural control flow and data sharing

15 years 7 months ago

Download www.cs.cmu.edu

Without rigorous software development and maintenance, software tends to lose its original architectural structure and become diﬃcult to understand and modify. ArchJava, a recen...

Marwan Abi-Antoun, Jonathan Aldrich, Wesley Coelho

claim paper

Read More »

186

click to vote

SBIA
2004
Springer

113views Artificial Intelligence» more SBIA 2004»

Learning with Drift Detection

16 years 12 days ago

Download www2.mat.ua.pt

Abstract. Most of the work in machine learning assume that examples are generated at random according to some stationary probability distribution. In this work we study the problem...

João Gama, Pedro Medas, Gladys Castillo, Pe...

claim paper

Read More »

216

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 425 / 655 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers