Sciweavers

2011 search results - page 211 / 403
» Universal Reinforcement Learning
Sort
View
201
Voted
SGAI
2010
Springer
15 years 3 months ago
Hierarchical Traces for Reduced NSM Memory Requirements
This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based ...
Torbjørn S. Dahl
INTERSPEECH
2010
14 years 12 months ago
Still talking to machines (cognitively speaking)
This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...
Steve Young
JMLR
2010
189views more  JMLR 2010»
14 years 12 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
SIGUCCS
2003
ACM
15 years 10 months ago
Leadership by design: collaborations and cornerstones
This paper chronicles the collaborative efforts of Valparaiso University’s IT department and main library over the past five years. Highlights of selected resources and services...
Trisha Mileham, Joyce E. Hicks
141
Voted
GECCO
2005
Springer
139views Optimization» more  GECCO 2005»
15 years 10 months ago
Event-driven learning classifier systems for online soccer games
This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...
Yuji Sato, Ryutaro Kanno