Sciweavers

1176 search results - page 9 / 236
» Sparse reward processes
Sort
View
AAMAS
2011
Springer
14 years 6 months ago
Optimizing coalition formation for tasks with dynamically evolving rewards and nondeterministic action effects
We consider a problem domain where coalitions of agents are formed in order to execute tasks. Each task is assigned at most one coalition of agents, and the coalition can be reorg...
Majid Ali Khan, Damla Turgut, Ladislau Böl&ou...
SIGECOM
2009
ACM
114views ECommerce» more  SIGECOM 2009»
15 years 6 months ago
Policy teaching through reward function learning
Policy teaching considers a Markov Decision Process setting in which an interested party aims to influence an agent’s decisions by providing limited incentives. In this paper, ...
Haoqi Zhang, David C. Parkes, Yiling Chen
SAC
2009
ACM
15 years 4 months ago
Leveraging OWL for GIS interoperability: rewards and pitfalls
Information systems often require combining datasets available in different formats, and geographical information systems are no exception. While semantic technologies have been u...
Serge Boucher, Esteban Zimányi
FOCS
2003
IEEE
15 years 5 months ago
Approximation Algorithms for Orienteering and Discounted-Reward TSP
In this paper, we give the rst constant-factor approximationalgorithmfor the rooted Orienteering problem, as well as a new problem that we call the Discounted-Reward TSP, motivate...
Avrim Blum, Shuchi Chawla, David R. Karger, Terran...
ATAL
2010
Springer
15 years 25 days ago
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...
W. Bradley Knox, Peter Stone