Search Sciweavers | Sciweavers

166 search results - page 11 / 34

» Online model learning in adversarial Markov decision process...

170

click to vote

Publication

233views

Sparse reward processes

13 years 10 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

ATAL
2005
Springer

117views Intelligent Agents» more ATAL 2005»

Modeling task allocation using a decision theoretic model

15 years 4 months ago

Download dis.cs.umass.edu

Mediation is the process of decomposing a task into subtasks, ﬁnding agents suitable for these subtasks and negotiating with agents to obtain commitments to execute these subtas...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

106

click to vote

EMMCVPR
2001
Springer

202views Computer Vision» more EMMCVPR 2001»

A Hierarchical Markov Random Field Model for Figure-Ground Segregation

15 years 3 months ago

Download www-2.cs.cmu.edu

To segregate overlapping objects into depth layers requires the integration of local occlusion cues distributed over the entire image into a global percept. We propose to model thi...

Stella X. Yu, Tai Sing Lee, Takeo Kanade

claim paper

Read More »

click to vote

ICML
2004
IEEE

123views Machine Learning» more ICML 2004»

Learning low dimensional predictive representations

16 years 3 days ago

Download www.cs.cmu.edu

Predictive state representations (PSRs) have recently been proposed as an alternative to partially observable Markov decision processes (POMDPs) for representing the state of a dy...

Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian...

claim paper

Read More »

click to vote

SOFSEM
2010
Springer

199views Theoretical Computer Science» more SOFSEM 2010»

Regret Minimization and Job Scheduling

15 years 8 months ago

Download eprints.pascal-network.org

Regret minimization has proven to be a very powerful tool in both computational learning theory and online algorithms. Regret minimization algorithms can guarantee, for a single de...

Yishay Mansour

claim paper

Read More »

« Prev « First page 11 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers