Search Sciweavers | Sciweavers

2005 search results - page 246 / 401

» Decisive Markov Chains

144

click to vote

FLAIRS
2008

115views Artificial Intelligence» more FLAIRS 2008»

State Space Compression with Predictive Representations

15 years 4 months ago

Download www.aaai.org

Current studies have demonstrated that the representational power of predictive state representations (PSRs) is at least equal to the one of partially observable Markov decision p...

Abdeslam Boularias, Masoumeh T. Izadi, Brahim Chai...

claim paper

Read More »

135

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 3 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

click to vote

AAAI
2010

172views Intelligent Agents» more AAAI 2010»

Using Bisimulation for Policy Transfer in MDPs

15 years 3 months ago

Download www.cs.mcgill.ca

Knowledge transfer has been suggested as a useful approach for solving large Markov Decision Processes. The main idea is to compute a decision-making policy in one environment and...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

IJCAI
2007

176views Artificial Intelligence» more IJCAI 2007»

Opponent Modeling in Scrabble

15 years 3 months ago

Download www.ijcai.org

Computers have already eclipsed the level of human play in competitive Scrabble, but there remains room for improvement. In particular, there is much to be gained by incorporating...

Mark Richards, Eyal Amir

claim paper

Read More »

138

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 3 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

« Prev « First page 246 / 401 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers