Search Sciweavers | Sciweavers

143

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 8 months ago

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

167

click to vote

CAINE
2008

127views Computer Science» more CAINE 2008»

Scripted Artificially Intelligent Basic Online Tactical Simulation

15 years 7 months ago

Download www.cse.unr.edu

For many years, introductory Computer Science courses have followed the same teaching paradigms. These paradigms utilize only simple console windows; more interactive approaches t...

Jesse D. Phillips, Roger V. Hoang, Joseph D. Mahsm...

claim paper

Read More »

168

click to vote

SASO
2008
IEEE

125views Control Systems» more SASO 2008»

Self-Adaptive Dissemination of Data in Dynamic Sensor Networks

16 years 8 days ago

Download www.datafusionlab.org

The distribution of data in large dynamic wireless sensor networks presents a difﬁcult problem due to node mobility, link failures, and trafﬁc congestion. In this paper, we pr...

David Dorsey, Bjorn Jay Carandang, Moshe Kam, Chri...

claim paper

Read More »

164

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

16 years 6 days ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

133

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

16 years 1 days ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers