Search Sciweavers | Sciweavers

513 search results - page 66 / 103

» Metric learning for reinforcement learning agents

Voted

FLAIRS
2006

109views Artificial Intelligence» more FLAIRS 2006»

Refining Human Behavior Models in a Context-based Architecture

15 years 1 months ago

Download www.aaai.org

This paper describes an investigation into the refinement of context-based human behavior models through the use of experiential learning. Specifically, a tactical agent was endow...

David Aihe, Avelino J. Gonzalez

claim paper

Read More »

click to vote

AAAI
2000

147views Intelligent Agents» more AAAI 2000»

ADVISOR: A Machine Learning Architecture for Intelligent Tutor Construction

15 years 1 months ago

Download www.aaai.org

We have constructed ADVISOR, a two-agent machine learning architecture for intelligent tutoring systems (ITS). The purpose of this architecture is to centralize the reasoning of a...

Joseph Beck, Beverly Park Woolf, Carole R. Beal

claim paper

Read More »

115

click to vote

ACL
1998

129views Computational Linguistics» more ACL 1998»

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email

15 years 1 months ago

Download acl.eldoc.ub.rug.nl

This paper describes a novel method by which a dialogue agent can learn to choose an optimal dialogue strategy. While it is widely agreed that dialogue strategies should be formul...

Marilyn A. Walker, Jeanne Frommer, Shrikanth Naray...

claim paper

Read More »

120

Voted

IJCNN
2008
IEEE

202views Neural Networks» more IJCNN 2008»

Learning to select relevant perspective in a dynamic environment

15 years 7 months ago

Download www.cs.qub.ac.uk

— When an agent observes its environment, there are two important characteristics of the perceived information. One is the relevance of information and the other is redundancy. T...

Zhihui Luo, David A. Bell, Barry McCollum, Qingxia...

claim paper

Read More »

131

Voted

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 1 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

« Prev « First page 66 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers