Search Sciweavers | Sciweavers

85 search results - page 14 / 17

» Markov Games as a Framework for Multi-Agent Reinforcement Le...

click to vote

GECCO
2009
Springer

200views Optimization» more GECCO 2009»

Apply ant colony optimization to Tetris

14 years 9 days ago

Download cs.nju.edu.cn

Tetris is a falling block game where the player’s objective is to arrange a sequence of diﬀerent shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...

Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...

claim paper

Read More »

click to vote

JSAC
2010

107views more JSAC 2010»

Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

13 years 4 months ago

Download medianetlab.ee.ucla.edu

Abstract—In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring,...

Hsien-Po Shiang, Mihaela van der Schaar

claim paper

Read More »

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

11 years 8 months ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

13 years 5 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 6 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 14 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers