Sciweavers

453 search results - page 30 / 91
» Learning from actions not taken: a multiagent learning algor...
Sort
View
82
Voted
ACL
2003
14 years 11 months ago
TotalRecall: A Bilingual Concordance for Computer Assisted Translation and Language Learning
This paper describes a Web-based English-Chinese concordance system, TotalRecall, developed to promote translation reuse and encourage authentic and idiomatic use in second langua...
Jian-Cheng Wu, Kevin C. Yeh, Thomas C. Chuang, Wen...
ICML
2003
IEEE
15 years 10 months ago
Hierarchical Policy Gradient Algorithms
Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...
Mohammad Ghavamzadeh, Sridhar Mahadevan
BMCBI
2007
133views more  BMCBI 2007»
14 years 9 months ago
Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
Background: Gene expression measurements during the development of the fly Drosophila melanogaster are routinely used to find functional modules of temporally co-expressed genes. ...
Ivan G. Costa, Roland Krause, Lennart Opitz, Alexa...
APIN
2004
81views more  APIN 2004»
14 years 9 months ago
Learning Generalized Policies from Planning Examples Using Concept Languages
In this paper we are concerned with the problem of learning how to solve planning problems in one domain given a number of solved instances. This problem is formulated as the probl...
Mario Martin, Hector Geffner
ECML
2007
Springer
15 years 3 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber