Search Sciweavers | Sciweavers

85 search results - page 17 / 17

» Markov Games as a Framework for Multi-Agent Reinforcement Le...

click to vote

ATAL
2008
Springer

124views Intelligent Agents» more ATAL 2008»

Social reward shaping in the prisoner's dilemma

13 years 6 months ago

Download www.aamas-conference.org

Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...

Monica Babes, Enrique Munoz de Cote, Michael L. Li...

claim paper

Read More »

click to vote

ATAL
2004
Springer

97views Intelligent Agents» more ATAL 2004»

Unifying Temporal and Structural Credit Assignment Problems

13 years 9 months ago

Download ti.arc.nasa.gov

Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

click to vote

CL
2000
Springer

156views Automated Reasoning» more CL 2000»

Logic, Knowledge Representation, and Bayesian Decision Theory

13 years 8 months ago

Download people.cs.ubc.ca

In this paper I give a brief overview of recent work on uncertainty inAI, and relate it to logical representations. Bayesian decision theory and logic are both normative frameworks...

David Poole

claim paper

Read More »

click to vote

ICAC
2005
IEEE

108views Applied Computing» more ICAC 2005»

Self-Optimizing Architecture for QoS Provisioning in Differentiated Services

13 years 10 months ago

Download csdl2.computer.org

This paper presents a scalable and self-optimizing architecture for Quality-of-Service (QoS) provisioning in the Differentiated Services (DiffServ) framework. The proposed archite...

Daniel Yagan, Chen-Khong Tham

claim paper

Read More »

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

12 years 11 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

« Prev « First page 17 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers