Sciweavers

10 search results - page 2 / 2
» Policy Transfer using Reward Shaping
Sort
View
ATAL
2008
Springer
13 years 7 months ago
Transfer of task representation in reinforcement learning using policy-based proto-value functions
Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...
Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...
ATAL
2010
Springer
13 years 6 months ago
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...
W. Bradley Knox, Peter Stone

Publication
165views
15 years 3 months ago
A Survey of the Use-It-Or-Lose-It Policies for the ABR Service in ATM Networks
The Available Bit Rate (ABR) service has been developed to support data applications over Asynchronous Transfer Mode (ATM). The ABR service uses a closed-loop rate-based traffic ma...
Shivkumar Kalyanaraman, Raj Jain, Rohit Goyal, Son...

Publication
162views
15 years 4 months ago
Use-it or Lose-it Policies for the Available Bit Rate (ABR) Service in ATM Networks
The Available Bit Rate (ABR) service has been developed to support 21st century data applications over Asynchronous Transfer Mode (ATM). The ABR service uses a closed-loop rate-bas...
Shivkumar Kalyanaraman, Raj Jain, Rohit Goyal, Son...
NIPS
2008
13 years 7 months ago
Goal-directed decision making in prefrontal cortex: a computational framework
Research in animal learning and behavioral neuroscience has distinguished between two forms of action control: a habit-based form, which relies on stored action values, and a goal...
Matthew Botvinick, James An