Search Sciweavers | Sciweavers

1233 search results - page 73 / 247

» Reinforcement Learning in MirrorBot

Voted

BC
2008

56views more BC 2008»

An implementation of reinforcement learning based on spike timing dependent plasticity

15 years 3 months ago

Download www.proberts.net

Patrick D. Roberts, Roberto A. Santiago, Gerardo L...

claim paper

Read More »

Voted

COLING
2008

108views Computational Linguistics» more COLING 2008»

Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets

15 years 3 months ago

Download www.aclweb.org

James Henderson, Oliver Lemon, Kallirroi Georgila

claim paper

Read More »

110

Voted

FGCS
2008

68views more FGCS 2008»

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

15 years 3 months ago

Download labs.oracle.com

David Vengerov

claim paper

Read More »

117

Voted

IJAIT
2008

60views more IJAIT 2008»

A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion

15 years 3 months ago

Download lpis.csd.auth.gr

Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...

claim paper

Read More »

132

Voted

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

15 years 3 months ago

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

« Prev « First page 73 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers