Search Sciweavers | Sciweavers

360 search results - page 5 / 72

» Learning Evaluation Functions for Large Acyclic Domains

Voted

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 3 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

BMCBI
2006

78views more BMCBI 2006»

An evaluation of human protein-protein interaction data in the public domain

14 years 11 months ago

Download www.biomedcentral.com

Background: Protein-protein interaction (PPI) databases have become a major resource for investigating biological networks and pathways in cells. A number of publicly available re...

Suresh Mathivanan, Balamurugan Periaswamy, T. K. B...

claim paper

Read More »

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 1 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

14 years 11 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

125

Voted

ACML
2009
Springer

300views Machine Learning» more ACML 2009»

Learning Algorithms for Domain Adaptation

15 years 4 months ago

Download www.cs.cmu.edu

A fundamental assumption for any machine learning task is to have training and test data instances drawn from the same distribution while having a sufﬁciently large number of tra...

Manas A. Pathak, Eric Nyberg

claim paper

Read More »

« Prev « First page 5 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers