Sciweavers

25 search results - page 3 / 5
» Learning in the Limit with Adversarial Disturbances
Sort
View
ICML
2008
IEEE
14 years 6 months ago
A worst-case comparison between temporal difference and residual gradient with linear function approximation
Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...
Lihong Li
STOC
2005
ACM
129views Algorithms» more  STOC 2005»
14 years 5 months ago
Learning with attribute costs
We study an extension of the "standard" learning models to settings where observing the value of an attribute has an associated cost (which might be different for differ...
Haim Kaplan, Eyal Kushilevitz, Yishay Mansour
ICML
2009
IEEE
14 years 6 months ago
Piecewise-stationary bandit problems with side observations
We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may c...
Jia Yuan Yu, Shie Mannor
JSAC
2010
188views more  JSAC 2010»
13 years 8 hour ago
Random-walk based approach to detect clone attacks in wireless sensor networks
Abstract--Wireless sensor networks (WSNs) deployed in hostile environments are vulnerable to clone attacks. In such attack, an adversary compromises a few nodes, replicates them, a...
Yingpei Zeng, Jiannong Cao, Shigeng Zhang, Shanqin...
ECTEL
2006
Springer
13 years 9 months ago
Guided and Interactive Factory Tours for Schools
School education today aims at improving the integration of school and professional life. A popular way to provide first hand experiences to students are guided factory tours. Comp...
Andreas Kaibel, Andreas Auwärter, Milos Kravc...