Search Sciweavers | Sciweavers

513 search results - page 52 / 103

» Metric learning for reinforcement learning agents

118

Voted

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 5 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

Voted

AAAI
2010

134views Intelligent Agents» more AAAI 2010»

Reinforcement Learning Via Practice and Critique Advice

15 years 2 months ago

Download web.engr.oregonstate.edu

We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our setting, the learner alternates between practicing, where learning is based on ac...

Kshitij Judah, Saikat Roy, Alan Fern, Thomas G. Di...

claim paper

Read More »

Voted

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

15 years 2 months ago

Download www.aaai.org

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

Voted

AAAI
2008

103views Intelligent Agents» more AAAI 2008»

Reinforcement Learning for Vulnerability Assessment in Peer-to-Peer Networks

15 years 2 months ago

Download web.engr.oregonstate.edu

Proactive assessment of computer-network vulnerability to unknown future attacks is an important but unsolved computer security problem where AI techniques have significant impact...

Scott Dejmal, Alan Fern, Thinh Nguyen

claim paper

Read More »

126

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

13 years 2 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

« Prev « First page 52 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers