Search Sciweavers | Sciweavers

2634 search results - page 243 / 527

» Argument based machine learning

137

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 5 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

116

click to vote

ICALT
2005
IEEE

121views Machine Learning» more ICALT 2005»

Building Repositories of Learning Objects in Specialized Domains: The Chasqui Approach

15 years 10 months ago

Download www.e-ucm.es

In this paper we describe the Chasqui approach to the construction of repositories of learning objects (LO) in specific knowledge areas. This approach is the result of our experie...

José Luis Sierra, Alfredo Fernández-...

claim paper

Read More »

138

click to vote

COLT
2006
Springer

85views Machine Learning» more COLT 2006»

A Randomized Online Learning Algorithm for Better Variance Control

15 years 8 months ago

Download certis.enpc.fr

We propose a sequential randomized algorithm, which at each step concentrates on functions having both low risk and low variance with respect to the previous step prediction functi...

Jean-Yves Audibert

claim paper

Read More »

129

click to vote

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

15 years 5 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

138

click to vote

AIEDAM
1998

87views more AIEDAM 1998»

Learning to set up numerical optimizations of engineering designs

15 years 4 months ago

Download ti.arc.nasa.gov

Gradient-based numerical optimization of complex engineering designs offers the promise of rapidly producing better designs. However, such methods generally assume that the object...

Mark Schwabacher, Thomas Ellman, Haym Hirsh

claim paper

Read More »

« Prev « First page 243 / 527 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers