Search Sciweavers | Sciweavers

1233 search results - page 181 / 247

» Reinforcement Learning in MirrorBot

142

click to vote

MICAI
2010
Springer

361views Artificial Intelligence» more MICAI 2010»

Teaching a Robot to Perform Tasks with Voice Commands

15 years 2 months ago

Download ccc.inaoep.mx

The full deployment of service robots in daily activities will require the robot to adapt to the needs of non-expert users, particularly, to learn how to perform new tasks from “...

Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...

claim paper

Read More »

130

click to vote

ITNG
2007
IEEE

118views Information Technology» more ITNG 2007»

Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals

15 years 10 months ago

Download eprints.qut.edu.au

This paper presents the recognition of Handwritten Hindi Numerals based on the modified exponential membership function fitted to the fuzzy sets derived from normalized distance f...

Madasu Hanmandlu, J. Grover, Vamsi Krishna Madasu,...

claim paper

Read More »

138

click to vote

ATAL
2003
Springer

176views Intelligent Agents» more ATAL 2003»

A selection-mutation model for q-learning in multi-agent systems

15 years 9 months ago

Download www.personeel.unimaas.nl

Although well understood in the single-agent framework, the use of traditional reinforcement learning (RL) algorithms in multi-agent systems (MAS) is not always justiﬁed. The fe...

Karl Tuyls, Katja Verbeeck, Tom Lenaerts

claim paper

Read More »

134

click to vote

ICML
2005
IEEE

137views Machine Learning» more ICML 2005»

Learning to compete, compromise, and cooperate in repeated general-sum games

16 years 4 months ago

Download www.mit.edu

Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...

Jacob W. Crandall, Michael A. Goodrich

claim paper

Read More »

147

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 4 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 181 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers