Sciweavers

1235 search results - page 238 / 247
» Reinforcement learning in a nutshell
Sort
View
ICCBR
2010
Springer
14 years 10 months ago
A General Introspective Reasoning Approach to Web Search for Case Adaptation
Abstract. Acquiring adaptation knowledge for case-based reasoning systems is a challenging problem. Such knowledge is typically elicited from domain experts or extracted from the c...
David B. Leake, Jay H. Powell
102
Voted
NN
2010
Springer
125views Neural Networks» more  NN 2010»
14 years 10 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
118
Voted
IAT
2010
IEEE
14 years 10 months ago
Selecting Operator Queries Using Expected Myopic Gain
When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...
Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...
158
Voted
PKDD
2010
Springer
164views Data Mining» more  PKDD 2010»
14 years 10 months ago
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations
Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...
Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...
130
Voted
GLOBECOM
2009
IEEE
14 years 10 months ago
Cooperative Communications with Relay Selection for QoS Provisioning in Wireless Sensor Networks
Abstract--Cooperative communications have been demonstrated to be effective in combating the multiple fading effects in wireless networks, and improving the network performance in ...
Xuedong Liang, Ilangko Balasingham, Victor C. M. L...