Search Sciweavers | Sciweavers

813 search results - page 105 / 163

» Ensemble Algorithms in Reinforcement Learning

143

click to vote

AGI
2008

136views Artificial Intelligence» more AGI 2008»

An Integrative Methodology for Teaching Embodied Non-Linguistic Agents, Applied to Virtual Animals in Second Life

15 years 5 months ago

Download www.novamente.net

A teaching methodology called Imitative-Reinforcement-Corrective (IRC) learning is described, and proposed as a general approach for teaching embodied non-linguistic AGI systems. I...

Ben Goertzel, Cassio Pennachin, Nil Geisweiller, M...

claim paper

Read More »

168

click to vote

CSE
2008
IEEE

172views Theoretical Computer Science» more CSE 2008»

Adaptation to Dynamic Resource Availability in Ad Hoc Grids through a Learning Mechanism

15 years 11 months ago

Download ce.et.tudelft.nl

Ad-hoc Grids are highly heterogeneous and dynamic networks, one of the main challenges of resource allocation in such environments is to ﬁnd mechanisms which do not rely on the ...

Behnaz Pourebrahimi, Koen Bertels

claim paper

Read More »

153

click to vote

AGENTS
1999
Springer

126views Security Privacy» more AGENTS 1999»

General Principles of Learning-Based Multi-Agent Systems

15 years 8 months ago

Download web.engr.oregonstate.edu

We consider the problem of how to design large decentralized multiagent systems (MAS’s) in an automated fashion, with little or no hand-tuning. Our approach has each agent run a...

David Wolpert, Kevin R. Wheeler, Kagan Tumer

claim paper

Read More »

133

click to vote

IJCNN
2007
IEEE

110views Neural Networks» more IJCNN 2007»

Optimizing 0/1 Loss for Perceptrons by Random Coordinate Descent

15 years 10 months ago

Download www.work.caltech.edu

—The 0/1 loss is an important cost function for perceptrons. Nevertheless it cannot be easily minimized by most existing perceptron learning algorithms. In this paper, we propose...

Ling Li, Hsuan-Tien Lin

claim paper

Read More »

131

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 5 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

« Prev « First page 105 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers