Sciweavers

119 search results - page 1 / 24
» Inter-Task Action Correlation for Reinforcement Learning Tas...
Sort
View
WSDM
2012
ACM
214views Data Mining» more  WSDM 2012»
12 years 11 days ago
Selecting actions for resource-bounded information extraction using reinforcement learning
Given a database with missing or uncertain content, our goal is to correct and fill the database by extracting specific information from a large corpus such as the Web, and to d...
Pallika H. Kanani, Andrew K. McCallum
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
13 years 11 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
SCAI
2008
13 years 6 months ago
Fast Learning in an Actor-Critic Architecture with Reward and Punishment
Abstract. A reinforcement architecture is introduced that consists of three complementary learning systems with different generalization abilities. The ACTOR learns state-action as...
Christian Balkenius, Stefan Winberg