Search Sciweavers | Sciweavers

148

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 5 months ago

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

143

click to vote

ATAL
2010
Springer

170views Intelligent Agents» more ATAL 2010»

Joint process games: from ratings to wikis

15 years 5 months ago

Download www.aamas-conference.org

We introduce a game setting called a joint process, where the history of actions determine the state, and the state and agent properties determine the payoff. This setting is a sp...

Michael Munie, Yoav Shoham

claim paper

Read More »

120

click to vote

MM
2010
ACM

123views Multimedia» more MM 2010»

Coming together: negotiated content by multi-agents

15 years 4 months ago

Download www.sfu.ca

In this paper, we describe a software system that generates unique musical compositions in realtime, created by four autonomous multi-agents. Given no explicit musical data, agent...

Arne Eigenfeldt

claim paper

Read More »

124

click to vote

MM
2010
ACM

137views Multimedia» more MM 2010»

Coming together: composition by negotiation

15 years 4 months ago

Download metacreation.net

In this paper, we describe a software system that generates unique musical compositions in realtime, created by four autonomous multi-agents. Given no explicit musical data, agent...

Arne Eigenfeldt

claim paper

Read More »

129

click to vote

CORR
2010
Springer

188views Education» more CORR 2010»

A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers

15 years 4 months ago

Download www.eecs.berkeley.edu

High-dimensional statistical inference deals with models in which the the number of parameters p is comparable to or larger than the sample size n. Since it is usually impossible ...

Sahand Negahban, Pradeep Ravikumar, Martin J. Wain...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers