Search Sciweavers | Sciweavers

We present and derive a new stick-breaking construction of the beta process. The construction is closely related to a special case of the stick-breaking construction of the Dirich...

John William Paisley, Aimee Zaas, Christopher W. W...

claim paper

Read More »

33

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

14 years 27 days ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers