Search Sciweavers | Sciweavers

1235 search results - page 238 / 247

» Reinforcement learning in a nutshell

162

click to vote

ICCBR
2010
Springer

229views Automated Reasoning» more ICCBR 2010»

A General Introspective Reasoning Approach to Web Search for Case Adaptation

15 years 3 months ago

Download www.cs.indiana.edu

Abstract. Acquiring adaptation knowledge for case-based reasoning systems is a challenging problem. Such knowledge is typically elicited from domain experts or extracted from the c...

David B. Leake, Jay H. Powell

claim paper

Read More »

146

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 3 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

168

click to vote

IAT
2010
IEEE

167views Intelligent Agents» more IAT 2010»

Selecting Operator Queries Using Expected Myopic Gain

15 years 2 months ago

Download www.eecs.umich.edu

When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...

Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...

claim paper

Read More »

208

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 2 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

179

Voted

GLOBECOM
2009
IEEE

253views Communications» more GLOBECOM 2009»

Cooperative Communications with Relay Selection for QoS Provisioning in Wireless Sensor Networks

15 years 2 months ago

Download mmlab.snu.ac.kr

Abstract--Cooperative communications have been demonstrated to be effective in combating the multiple fading effects in wireless networks, and improving the network performance in ...

Xuedong Liang, Ilangko Balasingham, Victor C. M. L...

claim paper

Read More »

« Prev « First page 238 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers