Search Sciweavers | Sciweavers

166 search results - page 28 / 34

» Online model learning in adversarial Markov decision process...

125

click to vote

NOSSDAV
2010
Springer

221views Computer Networks» more NOSSDAV 2010»

RTP-miner: a real-time security framework for RTP fuzzing attacks

15 years 4 months ago

Download nexginrc.org

Real-time Transport Protocol (RTP) is a widely adopted standard for transmission of multimedia traﬃc in Internet telephony (commonly known as VoIP). Therefore, it is a hot poten...

M. Ali Akbar, Muddassar Farooq

claim paper

Read More »

click to vote

ICML
2002
IEEE

128views Machine Learning» more ICML 2002»

Pruning Improves Heuristic Search for Cost-Sensitive Learning

16 years 3 days ago

Download web.engr.oregonstate.edu

This paper addresses cost-sensitive classification in the setting where there are costs for measuring each attribute as well as costs for misclassification errors. We show how to ...

Valentina Bayer Zubek, Thomas G. Dietterich

claim paper

Read More »

click to vote

ATAL
2010
Springer

115views Intelligent Agents» more ATAL 2010»

Self-organization for coordinating decentralized reinforcement learning

15 years 13 days ago

Download www.cs.umass.edu

Decentralized reinforcement learning (DRL) has been applied to a number of distributed applications. However, one of the main challenges faced by DRL is its convergence. Previous ...

Chongjie Zhang, Victor R. Lesser, Sherief Abdallah

claim paper

Read More »

100

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

16 years 4 days ago

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 21 days ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

« Prev « First page 28 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers