Search Sciweavers | Sciweavers

81 search results - page 16 / 17

» Partially observable Markov decision processes for spoken di...

click to vote

AAAI
2010

163views Intelligent Agents» more AAAI 2010»

Structured Parameter Elicitation

13 years 6 months ago

Download motion.comp.nus.edu.sg

The behavior of a complex system often depends on parameters whose values are unknown in advance. To operate effectively, an autonomous agent must actively gather information on t...

Li Ling Ko, David Hsu, Wee Sun Lee, Sylvie C. W. O...

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 6 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

ATAL
2010
Springer

157views Intelligent Agents» more ATAL 2010»

Augmenting appearance-based localization and navigation using belief update

13 years 6 months ago

Download www.aamas-conference.org

Appearance-based localization compares the current image taken from a robot's camera to a set of pre-recorded images in order to estimate the current location of the robot. S...

George Chrysanthakopoulos, Guy Shani

claim paper

Read More »

click to vote

DSN
2009
IEEE

131views Computer Networks» more DSN 2009»

RRE: A game-theoretic intrusion Response and Recovery Engine

13 years 3 months ago

Download netfiles.uiuc.edu

Preserving the availability and integrity of networked computing systems in the face of fast-spreading intrusions requires advances not only in detection algorithms, but also in a...

Saman A. Zonouz, Himanshu Khurana, William H. Sand...

claim paper

Read More »

click to vote

CISS
2008
IEEE

100views Information Technology» more CISS 2008»

Rate adaptation via link-layer feedback for goodput maximization over a time-varying channel

13 years 11 months ago

Download www.ece.osu.edu

Abstract—We consider adapting the transmission rate to maximize the goodput, i.e., the amount of data transmitted without error, over a continuous Markov ﬂat-fading wireless ch...

Rohit Aggarwal, Phil Schniter, Can Emre Koksal

claim paper

Read More »

« Prev « First page 16 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers