Search Sciweavers | Sciweavers

252 search results - page 10 / 51

» Learning Partially Observable Action Models: Efficient Algor...

126

click to vote

CVPR
2008
IEEE

304views Computer Vision» more CVPR 2008»

Context and observation driven latent variable model for human pose estimation

16 years 1 months ago

Download www.fxpal.com

Current approaches to pose estimation and tracking can be classified into two categories: generative and discriminative. While generative approaches can accurately determine human...

Abhinav Gupta, Trista Chen, Francine Chen, Don Kim...

claim paper

Read More »

112

Voted

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

14 years 11 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

click to vote

ML
2006
ACM

113views Machine Learning» more ML 2006»

Learning to bid in bridge

14 years 11 months ago

Download www.cs.technion.ac.il

Bridge bidding is considered to be one of the most difficult problems for game-playing programs. It involves four agents rather than two, including a cooperative agent. In additio...

Asaf Amit, Shaul Markovitch

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 13 days ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

103

Voted

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

15 years 22 days ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

« Prev « First page 10 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers