Search Sciweavers | Sciweavers

2905 search results - page 334 / 581

» Learning in Hyperlinked Environments

125

click to vote

CORR
2011
Springer

161views Education» more CORR 2011»

Doubly Robust Policy Evaluation and Learning

14 years 6 months ago

Download www.icml-2011.org

We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...

Miroslav Dudík, John Langford, Lihong Li

claim paper

Read More »

123

click to vote

CE
2005

170views more CE 2005»

Development of an environmental virtual field laboratory

15 years 3 months ago

Download soils.ifas.ufl.edu

Laboratory exercises, field observations and field trips are a fundamental part of many earth science and environmental science courses. Field observations and field trips can be ...

V. Ramasundaram, S. Grunwald, A. Mangeot, N. B. Co...

claim paper

Read More »

100

click to vote

ICML
2009
IEEE

117views Machine Learning» more ICML 2009»

K-means in space: a radiation sensitivity evaluation

16 years 3 months ago

Download www.wkiri.com

Spacecraft increasingly employ onboard data analysis to inform further data collection and prioritization decisions. However, many spacecraft operate in high-radiation environment...

Kiri L. Wagstaff, Benjamin Bornstein

claim paper

Read More »

122

click to vote

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

16 years 3 months ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

102

click to vote

ALT
2006
Springer

109views Machine Learning» more ALT 2006»

General Discounting Versus Average Reward

16 years 1 days ago

Download www.idsia.ch

Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to ...

Marcus Hutter

claim paper

Read More »

« Prev « First page 334 / 581 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers