Search Sciweavers | Sciweavers

3412 search results - page 195 / 683

» Efficient Reinforcement Learning

184

click to vote

SGAI
2010
Springer

226views Artificial Intelligence» more SGAI 2010»

Hierarchical Traces for Reduced NSM Memory Requirements

15 years 1 months ago

Download staff.newport.ac.uk

This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based ...

Torbjørn S. Dahl

claim paper

Read More »

148

click to vote

INTERSPEECH
2010

175views Signal Processing» more INTERSPEECH 2010»

Still talking to machines (cognitively speaking)

14 years 10 months ago

Download mi.eng.cam.ac.uk

This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...

Steve Young

claim paper

Read More »

169

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 10 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

137

Voted

FLAIRS
2004

138views Artificial Intelligence» more FLAIRS 2004»

A New Algorithm for Singleton Arc Consistency

15 years 5 months ago

Download www.aaai.org

Constraint satisfaction technology emerged from AI research. Its practical success is based on integration of sophisticated search with consistency techniques reducing the search ...

Roman Barták, Radek Erben

claim paper

Read More »

126

Voted

GECCO
2005
Springer

139views Optimization» more GECCO 2005»

Event-driven learning classifier systems for online soccer games

15 years 9 months ago

Download www.genetic-programming.org

This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...

Yuji Sato, Ryutaro Kanno

claim paper

Read More »

« Prev « First page 195 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers