Sciweavers

1455 search results - page 75 / 291
» Exploiting Myopic Learning
Sort
View
RAS
2010
131views more  RAS 2010»
15 years 1 months ago
Probabilistic Policy Reuse for inter-task transfer learning
Policy Reuse is a reinforcement learning technique that efficiently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...
Fernando Fernández, Javier García, M...
CVPR
2011
IEEE
14 years 6 months ago
Multi-label Learning with Incomplete Class Assignments
We consider a special type of multi-label learning where class assignments of training examples are incomplete. As an example, an instance whose true class assignment is (c1, c2, ...
Serhat Bucak, Rong Jin, Anil Jain
AAAI
2011
14 years 2 months ago
Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs
In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND...
Chongjie Zhang, Victor R. Lesser
EACL
2009
ACL Anthology
16 years 3 months ago
Sentiment Summarization: Evaluating and Learning User Preferences
We present the results of a large-scale, end-to-end human evaluation of various sentiment summarization models. The evaluation shows that users have a strong preference for summar...
Kevin Lerman, Sasha Blair-Goldensohn, Ryan T. McDo...
ICTAI
2009
IEEE
15 years 9 months ago
Learning for Dynamic Subsumption
This paper presents an original dynamic subsumption technique for Boolean CNF formulae. It exploits simple and sufficient conditions to detect, during conflict analysis, clauses...
Youssef Hamadi, Saïd Jabbour, Lakhdar Sais