Search Sciweavers | Sciweavers

656 search results - page 103 / 132

» Complexity of finite-horizon Markov decision process problem...

112

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 2 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

121

Voted

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

16 years 2 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

110

Voted

ICML
2002
IEEE

128views Machine Learning» more ICML 2002»

Pruning Improves Heuristic Search for Cost-Sensitive Learning

16 years 2 months ago

Download web.engr.oregonstate.edu

This paper addresses cost-sensitive classification in the setting where there are costs for measuring each attribute as well as costs for misclassification errors. We show how to ...

Valentina Bayer Zubek, Thomas G. Dietterich

claim paper

Read More »

112

click to vote

CDC
2008
IEEE

197views Control Systems» more CDC 2008»

Dynamic spectrum access policies for cognitive radio

15 years 8 months ago

Download www.ifp.illinois.edu

—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

104

click to vote

CDC
2008
IEEE

204views Control Systems» more CDC 2008»

Dynamic ping optimization for surveillance in multistatic sonar buoy networks with energy constraints

15 years 8 months ago

Download www.cs.jhu.edu

— In this paper we study the problem of dynamic optimization of ping schedule in an active sonar buoy network deployed to provide persistent surveillance of a littoral area throu...

Anshu Saksena, I-Jeng Wang

claim paper

Read More »

« Prev « First page 103 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers