Search Sciweavers | Sciweavers

513 search results - page 33 / 103

» Metric learning for reinforcement learning agents

109

Voted

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 2 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 2 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

134

Voted

ICCBR
2009
Springer

134views Automated Reasoning» more ICCBR 2009»

Improving Reinforcement Learning by Using Case Based Heuristics

15 years 7 months ago

Download www.iiia.csic.es

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and ...

Reinaldo A. C. Bianchi, Raquel Ros, Ramon Ló...

claim paper

Read More »

Voted

PRICAI
2000
Springer

127views Artificial Intelligence» more PRICAI 2000»

Constructing an Autonomous Agent with an Interdependent Heuristics

15 years 4 months ago

Download www.ai.sanken.osaka-u.ac.jp

When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we ...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

105

Voted

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

15 years 1 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

« Prev « First page 33 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers