Search Sciweavers | Sciweavers

94 search results - page 17 / 19

» Sequential cost-sensitive decision making with reinforcement...

click to vote

NN
2006
Springer

140views Neural Networks» more NN 2006»

Neural mechanism for stochastic behaviour during a competitive game

13 years 5 months ago

Download wanglab.med.yale.edu

Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...

Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 6 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

COLT
2007
Springer

104views Machine Learning» more COLT 2007»

Observational Learning in Random Networks

13 years 12 months ago

Download www.as.inf.ethz.ch

In the standard model of observational learning, n agents sequentially decide between two alternatives a or b, one of which is objectively superior. Their choice is based on a stoc...

Julian Lorenz, Martin Marciniszyn, Angelika Steger

claim paper

Read More »

click to vote

ATAL
2009
Springer

170views Intelligent Agents» more ATAL 2009»

Bounded rationality via recursion

14 years 11 days ago

Download www.ifaamas.org

Current trends in model construction in the ﬁeld of agentbased computational economics base behavior of agents on either game theoretic procedures (e.g. belief learning, ﬁctit...

Maciej Latek, Robert L. Axtell, Bogumil Kaminski

claim paper

Read More »

click to vote

ICMLA
2009

181views Machine Learning» more ICMLA 2009»

Sensitivity Analysis of POMDP Value Functions

13 years 3 months ago

Download www.cs.cmu.edu

In sequential decision making under uncertainty, as in many other modeling endeavors, researchers observe a dynamical system and collect data measuring its behavior over time. The...

Stéphane Ross, Masoumeh T. Izadi, Mark Merc...

claim paper

Read More »

« Prev « First page 17 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers