Search Sciweavers | Sciweavers

166 search results - page 23 / 34

» Learning of Event-Recording Automata

128

click to vote

ATAL
2008
Springer

115views Intelligent Agents» more ATAL 2008»

Switching dynamics of multi-agent learning

15 years 5 months ago

Download www.ifaamas.org

This paper presents the dynamics of multi-agent reinforcement learning in multiple state problems. We extend previous work that formally modelled the relation between reinforcemen...

Peter Vrancx, Karl Tuyls, Ronald L. Westra

claim paper

Read More »

109

click to vote

ECAL
2001
Springer

118views Artificial Intelligence» more ECAL 2001»

Pareto Optimality in Coevolutionary Learning

15 years 7 months ago

Download www.demo.cs.brandeis.edu

We develop a novel coevolutionary algorithm based upon the concept of Pareto optimality. The Pareto criterion is core to conventional multi-objective optimization (MOO) algorithms....

Sevan G. Ficici, Jordan B. Pollack

claim paper

Read More »

101

click to vote

AAAI
1996

118views Intelligent Agents» more AAAI 1996»

Learning Models of Intelligent Agents

15 years 4 months ago

Download www.cs.technion.ac.il

Agents that operate in a multi-agent system need an efficient strategy to handle their encounters with other agents involved. Searching for an optimal interactive strategy is a ha...

David Carmel, Shaul Markovitch

claim paper

Read More »

125

click to vote

ICML
2004
IEEE

197views Machine Learning» more ICML 2004»

Distribution kernels based on moments of counts

16 years 4 months ago

Download www.cs.nyu.edu

Many applications in text and speech processing require the analysis of distributions of variable-length sequences. We recently introduced a general kernel framework, rational ker...

Corinna Cortes, Mehryar Mohri

claim paper

Read More »

138

click to vote

ICML
2010
IEEE

188views Machine Learning» more ICML 2010»

Constructing States for Reinforcement Learning

15 years 1 months ago

Download www.icml2010.org

POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...

M. M. Hassan Mahmud

claim paper

Read More »

« Prev « First page 23 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers