Search Sciweavers | Sciweavers

453 search results - page 30 / 91

» Learning from actions not taken: a multiagent learning algor...

click to vote

ACL
2003

97views Computational Linguistics» more ACL 2003»

TotalRecall: A Bilingual Concordance for Computer Assisted Translation and Language Learning

15 years 1 months ago

Download acl.ldc.upenn.edu

This paper describes a Web-based English-Chinese concordance system, TotalRecall, developed to promote translation reuse and encourage authentic and idiomatic use in second langua...

Jian-Cheng Wu, Kevin C. Yeh, Thomas C. Chuang, Wen...

claim paper

Read More »

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 16 days ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

click to vote

BMCBI
2007

133views more BMCBI 2007»

Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data

14 years 11 months ago

Download www.biomedcentral.com

Background: Gene expression measurements during the development of the fly Drosophila melanogaster are routinely used to find functional modules of temporally co-expressed genes. ...

Ivan G. Costa, Roland Krause, Lennart Opitz, Alexa...

claim paper

Read More »

click to vote

APIN
2004

81views more APIN 2004»

Learning Generalized Policies from Planning Examples Using Concept Languages

14 years 11 months ago

Download www.dtic.upf.edu

In this paper we are concerned with the problem of learning how to solve planning problems in one domain given a number of solved instances. This problem is formulated as the probl...

Mario Martin, Hector Geffner

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 6 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 30 / 91 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers