Search Sciweavers | Sciweavers

1310 search results - page 67 / 262

» Progressive Optimization in Action

151

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 5 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

145

click to vote

DAGM
2006
Springer

121views Image Processing» more DAGM 2006»

Handling Camera Movement Constraints in Reinforcement Learning Based Active Object Recognition

15 years 8 months ago

Download www5.informatik.uni-erlangen.de

In real world scenes, objects to be classified are usually not visible from every direction, since they are almost always positioned on some kind of opaque plane. When moving a cam...

Christian Derichs, Heinrich Niemann

claim paper

Read More »

142

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

15 years 5 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

142

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

Symbolic Dynamic Programming for First-order POMDPs

15 years 5 months ago

Download www-kd.iai.uni-bonn.de

Partially-observable Markov decision processes (POMDPs) provide a powerful model for sequential decision-making problems with partially-observed state and are known to have (appro...

Scott Sanner, Kristian Kersting

claim paper

Read More »

145

click to vote

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

15 years 10 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

« Prev « First page 67 / 262 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers