Search Sciweavers | Sciweavers

252 search results - page 4 / 51

» Learning Partially Observable Action Models: Efficient Algor...

109

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 5 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

128

Voted

ATAL
2007
Springer

143views Intelligent Agents» more ATAL 2007»

On discovery and learning of models with predictive representations of state for agents with continuous actions and observations

15 years 3 months ago

Download web.mit.edu

Models of agent-environment interaction that use predictive state representations (PSRs) have mainly focused on the case of discrete observations and actions. The theory of discre...

David Wingate, Satinder P. Singh

claim paper

Read More »

103

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

15 years 21 days ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

click to vote

ICML
2009
IEEE

226views Machine Learning» more ICML 2009»

Large margin training for hidden Markov models with partially observed states

16 years 12 days ago

Download webia.lip6.fr

Large margin learning of Continuous Density HMMs with a partially labeled dataset has been extensively studied in the speech and handwriting recognition fields. Yet due to the non...

Thierry Artières, Trinh Minh Tri Do

claim paper

Read More »

121

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

13 years 2 months ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

« Prev « First page 4 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers