Sciweavers

109 search results - page 13 / 22
» Policy teaching through reward function learning
Sort
View
CORR
2010
Springer
171views Education» more  CORR 2010»
14 years 6 months ago
Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach
We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...
Cem Tekin, Mingyan Liu
ICML
2003
IEEE
16 years 15 days ago
Principled Methods for Advising Reinforcement Learning Agents
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled manner, especially as we scale up to real-world tasks. In this paper, we presen...
Eric Wiewiora, Garrison W. Cottrell, Charles Elkan
COLING
2010
14 years 6 months ago
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...
IADIS
2004
15 years 1 months ago
Addressing the Effective Use of Learning Objects Through Teacher Education
This paper describes the development and evaluation of a curriculum designed to help teachers learn about and integrate digital library functionalities and learning objects into t...
Mimi Recker
ATAL
2008
Springer
15 years 1 months ago
Teaching multi-robot coordination using demonstration of communication and state sharing
Solutions to complex tasks often require the cooperation of multiple robots, however, developing multi-robot policies can present many challenges. In this work, we introduce teach...
Sonia Chernova, Manuela M. Veloso