Search Sciweavers | Sciweavers

43 search results - page 8 / 9

» Autonomous Inter-Task Transfer in Reinforcement Learning Dom...

click to vote

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

13 years 8 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

click to vote

GECCO
2005
Springer

162views Optimization» more GECCO 2005»

An autonomous explore/exploit strategy

13 years 11 months ago

Download www.personal.reading.ac.uk

In reinforcement learning problems it has been considered that neither exploitation nor exploration can be pursued exclusively without failing at the task. The optimal balance bet...

Alex McMahon, Dan Scott, William N. L. Browne

claim paper

Read More »

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

13 years 5 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

click to vote

IAT
2010
IEEE

167views Intelligent Agents» more IAT 2010»

Selecting Operator Queries Using Expected Myopic Gain

13 years 3 months ago

Download www.eecs.umich.edu

When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...

Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...

claim paper

Read More »

click to vote

JAIR
2008

148views more JAIR 2008»

Learning Partially Observable Deterministic Action Models

13 years 6 months ago

Download www.jair.org

We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...

Eyal Amir, Allen Chang

claim paper

Read More »

« Prev « First page 8 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers