Sciweavers

355 search results - page 31 / 71
» Online Learning and Exploiting Relational Models in Reinforc...
Sort
View
ATAL
2005
Springer
15 years 3 months ago
Rapid on-line temporal sequence prediction by an adaptive agent
Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning ...
Steven Jensen, Daniel Boley, Maria L. Gini, Paul R...
ROMAN
2007
IEEE
179views Robotics» more  ROMAN 2007»
15 years 4 months ago
Online Affect Detection and Adaptation in Robot Assisted Rehabilitation for Children with Autism
–This paper presents a novel affect-sensitive human-robot interaction framework for rehabilitation of children with autism spectrum disorder (ASD) where the robot can detect the ...
Changchun Liu, Karla Conn, Nilanjan Sarkar, Wendy ...
SIGIR
2010
ACM
15 years 1 months ago
How good is a span of terms?: exploiting proximity to improve web retrieval
Ranking search results is a fundamental problem in information retrieval. In this paper we explore whether the use of proximity and phrase information can improve web retrieval ac...
Krysta Marie Svore, Pallika H. Kanani, Nazan Khan
NIPS
2001
14 years 11 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
GAMEON
2007
14 years 11 months ago
Agent Based Virtual Tutorship and E-Learning Techniques Applied to a Business Game Built on System Dynamics
An advanced Business Game is presented in the paper, built on the methodology of System Dynamics. It can be used for cognitive learning and knowledge transmission in schools and U...
Marco Remondino