Search Sciweavers | Sciweavers

88

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 1 months ago

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

94

click to vote

FLAIRS
2004

119views Artificial Intelligence» more FLAIRS 2004»

Recurrent Neural Networks and Pitch Representations for Music Tasks

15 years 1 months ago

Download maven.smith.edu

We present results from experiments in using several pitch representations for jazz-oriented musical tasks performed by a recurrent neural network. We have run experiments with se...

Judy A. Franklin

claim paper

Read More »

103

click to vote

NN
1998
Springer

108views Neural Networks» more NN 1998»

How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies

14 years 11 months ago

Download clgiles.ist.psu.edu

Learning long-term temporal dependencies with recurrent neural networks can be a difﬁcult problem. It has recently been shown that a class of recurrent neural networks called NA...

Tsungnan Lin, Bill G. Horne, C. Lee Giles

claim paper

Read More »

77

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 16 days ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

103

click to vote

ORL
2008

68views more ORL 2008»

On polynomial cases of the unichain classification problem for Markov Decision Processes

14 years 11 months ago

Download www.ams.sunysb.edu

The unichain classification problem detects whether a finite state and action MDP is unichain under all deterministic policies. This problem is NP-hard [11]. This paper provides p...

Eugene A. Feinberg, Fenghsu Yang

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers