Sciweavers

2905 search results - page 334 / 581
» Learning in Hyperlinked Environments
Sort
View
CORR
2011
Springer
161views Education» more  CORR 2011»
14 years 6 months ago
Doubly Robust Policy Evaluation and Learning
We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...
Miroslav Dudík, John Langford, Lihong Li
CE
2005
170views more  CE 2005»
15 years 3 months ago
Development of an environmental virtual field laboratory
Laboratory exercises, field observations and field trips are a fundamental part of many earth science and environmental science courses. Field observations and field trips can be ...
V. Ramasundaram, S. Grunwald, A. Mangeot, N. B. Co...
ICML
2009
IEEE
16 years 3 months ago
K-means in space: a radiation sensitivity evaluation
Spacecraft increasingly employ onboard data analysis to inform further data collection and prioritization decisions. However, many spacecraft operate in high-radiation environment...
Kiri L. Wagstaff, Benjamin Bornstein
ICML
1995
IEEE
16 years 3 months ago
Stable Function Approximation in Dynamic Programming
The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...
Geoffrey J. Gordon
ALT
2006
Springer
16 years 1 days ago
General Discounting Versus Average Reward
Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to ...
Marcus Hutter