Search Sciweavers | Sciweavers

77

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

15 years 1 months ago

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

77

click to vote

AAAI
2008

103views Intelligent Agents» more AAAI 2008»

Reinforcement Learning for Vulnerability Assessment in Peer-to-Peer Networks

15 years 1 months ago

Download web.engr.oregonstate.edu

Proactive assessment of computer-network vulnerability to unknown future attacks is an important but unsolved computer security problem where AI techniques have significant impact...

Scott Dejmal, Alan Fern, Thinh Nguyen

claim paper

Read More »

87

click to vote

ESAW
2008
Springer

105views Intelligent Agents» more ESAW 2008»

Contribution to the Control of a MAS's Global Behaviour: Reinforcement Learning Tools

15 years 19 days ago

Download hal.archives-ouvertes.fr

Reactive multi-agent systems present global behaviours uneasily linked to their local dynamics. When it comes to controlling such a system, usual analytical tools are difficult to ...

François Klein, Christine Bourjot, Vincent ...

claim paper

Read More »

82

click to vote

AAAI
2006

129views Intelligent Agents» more AAAI 2006»

On the Difficulty of Modular Reinforcement Learning for Real-World Partial Programming

15 years 8 days ago

Download www.cc.gatech.edu

In recent years there has been a great deal of interest in "modular reinforcement learning" (MRL). Typically, problems are decomposed into concurrent subgoals, allowing ...

Sooraj Bhat, Charles Lee Isbell Jr., Michael Matea...

claim paper

Read More »

82

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 days ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers