Sciweavers

2108 search results - page 131 / 422

» Tracking in Reinforcement Learning

100

BC
2008

56views more BC 2008»

An implementation of reinforcement learning based on spike timing dependent plasticity

15 years 4 months ago

An implementation of reinforcement learning based on spike timing dependent plasticity

Download www.proberts.net

Patrick D. Roberts, Roberto A. Santiago, Gerardo L...

claim paper

Read More »

74

COLING
2008

108views Computational Linguistics» more COLING 2008»

Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets

15 years 4 months ago

Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets

Download www.aclweb.org

James Henderson, Oliver Lemon, Kallirroi Georgila

claim paper

Read More »

112

FGCS
2008

68views more FGCS 2008»

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

15 years 3 months ago

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

Download labs.oracle.com

David Vengerov

claim paper

Read More »

119

IJAIT
2008

60views more IJAIT 2008»

A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion

15 years 3 months ago

A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion

Download lpis.csd.auth.gr

Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...

claim paper

Read More »

133

Voted

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

15 years 3 months ago

Universal Reinforcement Learning

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

« Prev « First page 131 / 422 Last » Next »