Search Sciweavers | Sciweavers

250 search results - page 39 / 50

» Learning action effects in partially observable domains

Voted

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

14 years 11 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

120

click to vote

CVPR
2012
IEEE

238views Computer Vision» more CVPR 2012»

Understanding collective crowd behaviors: Learning a Mixture model of Dynamic pedestrian-Agents

13 years 2 months ago

Download personal.ie.cuhk.edu.hk

In this paper, a new Mixture model of Dynamic pedestrian-Agents (MDA) is proposed to learn the collective behavior patterns of pedestrians in crowded scenes. Collective behaviors ...

Bolei Zhou, Xiaogang Wang, Xiaoou Tang

claim paper

Read More »

Voted

AAAI
2012

215views Intelligent Agents» more AAAI 2012»

POMDPs Make Better Hackers: Accounting for Uncertainty in Penetration Testing

13 years 2 months ago

Download corelabs.coresecurity.com

Penetration Testing is a methodology for assessing network security, by generating and executing possible hacking attacks. Doing so automatically allows for regular and systematic...

Carlos Sarraute, Olivier Buffet, Jörg Hoffman...

claim paper

Read More »

155

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

13 years 7 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

click to vote

AIPS
2006

109views Artificial Intelligence» more AIPS 2006»

Fast Probabilistic Planning through Weighted Model Counting

15 years 1 months ago

Download www.aaai.org

We present a new algorithm for probabilistic planning with no observability. Our algorithm, called Probabilistic-FF, extends the heuristic forward-search machinery of Conformant-F...

Carmel Domshlak, Jörg Hoffmann

claim paper

Read More »

« Prev « First page 39 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers