Search Sciweavers | Sciweavers

1246 search results - page 138 / 250

» Online testing with model programs

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 2 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

113

Voted

WOA
2007

88views Intelligent Agents» more WOA 2007»

A Swarm Intelligence Method Applied to Manufacturing Scheduling

15 years 1 months ago

Download woa07.disi.unige.it

—In this paper we present a multi-agent search technique to face the NP-hard single machine total weighted tardiness scheduling problem in presence of sequence-dependent setup ti...

Davide Anghinolfi, Antonio Boccalatte, Alberto Gro...

claim paper

Read More »

109

Voted

ICRA
2010
IEEE

148views Robotics» more ICRA 2010»

Body schema acquisition through active learning

14 years 11 months ago

Download users.isr.ist.utl.pt

— We present an active learning algorithm for the problem of body schema learning, i.e. estimating a kinematic model of a serial robot. The learning process is done online using ...

Ruben Martinez-Cantin, Manuel Lopes, Luis Montesan...

claim paper

Read More »

Voted

AAAI
2012

190views Intelligent Agents» more AAAI 2012»

Evaluating Resistance to False-Name Manipulations in Elections

13 years 3 months ago

Download people.seas.harvard.edu

In many mechanisms (especially online mechanisms), a strategic agent can inﬂuence the outcome by creating multiple false identities. We consider voting settings where the mechan...

Bo Waggoner, Lirong Xia, Vincent Conitzer

claim paper

Read More »

Voted

JCP
2008

126views more JCP 2008»

Hardware/Software Co-design Approach for an ADALINE Based Adaptive Control System

15 years 23 days ago

Download saturn.ee.psu.ac.th

Abstract--In this paper, we report some results on hardware and software co-design of an adaptive linear neuron (ADALINE) based control system. A discrete-time Proportional-Integra...

Shouling He, Xuping Xu

claim paper

Read More »

« Prev « First page 138 / 250 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers