Search Sciweavers | Sciweavers

1235 search results - page 68 / 247

» Reinforcement learning in a nutshell

122

click to vote

CSE
2009
IEEE

85views Theoretical Computer Science» more CSE 2009»

Reinforcement Learning of Listener Response for Mood Classification of Audio

15 years 11 months ago

Download www.oddible.com

This paper describes a method of applying a reinforcement learning artificial intelligence to categorize audio files by mood based on listener response during a performance. The s...

Jack Stockholm, Philippe Pasquier

claim paper

Read More »

121

click to vote

AI
2006
Springer

103views Artificial Intelligence» more AI 2006»

Trace Equivalence Characterization Through Reinforcement Learning

15 years 8 months ago

Download www2.ift.ulaval.ca

In the context of probabilistic verification, we provide a new notion of trace-equivalence divergence between pairs of Labelled Markov processes. This divergence corresponds to the...

Josee Desharnais, François Laviolette, Kris...

claim paper

Read More »

110

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 7 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

162

click to vote

ATAL
2008
Springer

176views Intelligent Agents» more ATAL 2008»

Analysis of an evolutionary reinforcement learning method in a multiagent domain

15 years 7 months ago

Download www.aamas-conference.org

Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algo...

Jan Hendrik Metzen, Mark Edgington, Yohannes Kassa...

claim paper

Read More »

110

click to vote

ICPR
2006
IEEE

260views computer vision» more ICPR 2006»

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

16 years 6 months ago

Download ee2.chit.edu.tw

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...

Siwei Luo, Yu Zheng, Ziang Lv

claim paper

Read More »

« Prev « First page 68 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers