Search Sciweavers | Sciweavers

449 search results - page 27 / 90

» Finding Structure in Reinforcement Learning

200

click to vote

ATAL
2004
Springer

97views Intelligent Agents» more ATAL 2004»

Unifying Temporal and Structural Credit Assignment Problems

16 years 25 days ago

Download ti.arc.nasa.gov

Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

182

click to vote

ACMICEC
2007
ACM

102views ECommerce» more ACMICEC 2007»

Learning to trade with insider information

15 years 11 months ago

Download www.cs.rpi.edu

This paper introduces algorithms for learning how to trade using insider (superior) information in Kyle's model of financial markets. Prior results in finance theory relied o...

Sanmay Das

claim paper

Read More »

194

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 8 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

272

click to vote

AAMAS
2005
Springer

174views Intelligent Agents» more AAMAS 2005»

Cooperative Multi-Agent Learning: The State of the Art

15 years 7 months ago

Download cs.gmu.edu

Cooperative multi-agent systems are ones in which several agents attempt, through their interaction, to jointly solve tasks or to maximize utility. Due to the interactions among t...

Liviu Panait, Sean Luke

claim paper

Read More »

186

click to vote

ISNN
2007
Springer

116views Neural Networks» more ISNN 2007»

Online Dynamic Value System for Machine Learning

16 years 1 months ago

Download www.ent.ohiou.edu

A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...

Haibo He, Janusz A. Starzyk

claim paper

Read More »

« Prev « First page 27 / 90 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers