Search Sciweavers | Sciweavers

4345 search results - page 120 / 869

» Relational Reinforcement Learning

133

Voted

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

15 years 9 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

141

Voted

HICSS
2000
IEEE

134views Biometrics» more HICSS 2000»

Peer-to-Peer Valuation as a Mechanism for Reinforcing Active Learning in Virtual Communities: Actualizing Social Exchange Theory

15 years 7 months ago

Download www.bus.iastate.edu

As knowledge becomes the primary focus of work in many industries, virtual communities and groups are emerging as part of new organizational forms. Within these virtual forms, eff...

Amrit Tiwana, Ashley A. Bush

claim paper

Read More »

132

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

15 years 5 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

108

Voted

NECO
2007

87views more NECO 2007»

Reinforcement Learning State Estimator

15 years 2 months ago

Download www.nc.irp.oist.jp

cal networks in the learning of abstract and effector-specific representations of motor sequences. Neuroimage. 32, 714-727. (Neuroimage Editor’s Choice Award, 2006) Daw, N. D. Do...

Jun Morimoto, Kenji Doya

claim paper

Read More »

160

click to vote

AAAI
2011

202views Intelligent Agents» more AAAI 2011»

Value Function Approximation in Reinforcement Learning Using the Fourier Basis

14 years 3 months ago

Download people.csail.mit.edu

We describe the Fourier Basis, a linear value function approximation scheme based on the Fourier Series. We empirically evaluate its properties, and demonstrate that it performs w...

George Konidaris, Sarah Osentoski, Philip Thomas

claim paper

Read More »

« Prev « First page 120 / 869 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers