Search Sciweavers | Sciweavers

3412 search results - page 156 / 683

» Efficient Reinforcement Learning

140

Voted

EUSFLAT
2009

140views Fuzzy Logic» more EUSFLAT 2009»

Incremental Possibilistic Approach for Online Clustering and Classification

15 years 1 months ago

Download www.eusflat.org

In this paper, we propose to develop the supervised classification method Fuzzy Pattern Matching to be in addition a non supervised one. The goal is to monitor dynamic systems with...

Moamar Sayed Mouchaweh, Bernard Riera

claim paper

Read More »

122

click to vote

ICML
2005
IEEE

137views Machine Learning» more ICML 2005»

Learning to compete, compromise, and cooperate in repeated general-sum games

16 years 4 months ago

Download www.mit.edu

Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...

Jacob W. Crandall, Michael A. Goodrich

claim paper

Read More »

135

Voted

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 5 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

102

Voted

DAC
1993
ACM

92views Computer Architecture» more DAC 1993»

A Negative Reinforcement Method for PGA Routing

15 years 7 months ago

Download www.cs.uky.edu

We present an efficient and effective method for the detailed routing of symmetrical or sea-of-gates FPGA architectures. Instead of breaking the problem into 2-terminal net collec...

Forbes D. Lewis, Wang Chia-Chi Pong

claim paper

Read More »

140

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 4 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 156 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers