Sciweavers

908 search results - page 99 / 182
» Interactive regret minimization
Sort
View
NIPS
1997
15 years 7 months ago
Nonparametric Model-Based Reinforcement Learning
This paper describes some of the interactions of model learning algorithms and planning algorithms we have found in exploring model-based reinforcement learning. The paper focuses...
Christopher G. Atkeson
CADE
2010
Springer
15 years 7 months ago
Sledgehammer: Judgement Day
Abstract. Sledgehammer, a component of the interactive theorem prover Isabelle, finds proofs in higher-order logic by calling the automated provers for first-order logic E, SPASS a...
Sascha Böhme, Tobias Nipkow
156
Voted
CGF
2007
97views more  CGF 2007»
15 years 6 months ago
Volume Preservation of Multiresolution Meshes
Geometric constraints have proved to be efficient for enhancing the realism of shape animation. The present paper addresses the computation and the preservation of the volume enc...
Basile Sauvage, Stefanie Hahmann, Georges-Pierre B...
CORR
2007
Springer
73views Education» more  CORR 2007»
15 years 6 months ago
Universal Reinforcement Learning
—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence futu...
Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...
JIB
2006
102views more  JIB 2006»
15 years 6 months ago
3D image and graph based Computation of Protein Surface
The accessible surface of a macromolecule is a significant determinant of its action. The interaction between biomolecules or protein-ligand is dependent on their surfaces rather ...
A. Ranganath, K. C. Shet, N. Vidyavathi