Search Sciweavers | Sciweavers

847 search results - page 88 / 170

» Learning Action Selection Network of Intelligent Agent

127

click to vote

AAAI
2007

84views Intelligent Agents» more AAAI 2007»

Stochastic Optimization for Collision Selection in High Energy Physics

15 years 5 months ago

Download www.aaai.org

Artiﬁcial intelligence has begun to play a critical role in basic science research. In high energy physics, AI methods can aid precision measurements that elucidate the underlyi...

Shimon Whiteson, Daniel Whiteson

claim paper

Read More »

111

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

15 years 10 months ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

134

Voted

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

15 years 4 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

119

Voted

IAT
2003
IEEE

111views Intelligent Agents» more IAT 2003»

MPIAB: A Novel Agent Architecture for Parallel Processing

15 years 8 months ago

Download wiki.cogkit.org

This paper presents MPIAB, an agent based architecture for parallel processing. The architecture is developed to model the functions of standard MPI using java agents. It remedies...

Shahram Rahimi, Ajay Narayanan, Meha Sabharwal

claim paper

Read More »

135

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 4 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

« Prev « First page 88 / 170 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers