Search Sciweavers | Sciweavers

449 search results - page 25 / 90

» Finding Structure in Reinforcement Learning

193

click to vote

GECCO
2006
Springer

177views Optimization» more GECCO 2006»

Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure

15 years 11 months ago

Download www.eskimo.com

The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

232

Voted

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

15 years 12 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

178

click to vote

NIPS
2004

172views Information Technology» more NIPS 2004»

15 years 8 months ago

Similarity and Discrimination in Classical Conditioning: A Latent Variable Account

Download www.cns.nyu.edu

We propose a probabilistic, generative account of configural learning phenomena in classical conditioning. Configural learning experiments probe how animals discriminate and gener...

Aaron C. Courville, Nathaniel D. Daw, David S. Tou...

claim paper

Read More »

202

Voted

IROS
2008
IEEE

165views Robotics» more IROS 2008»

Mutual development of behavior acquisition and recognition based on value system

16 years 1 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...

Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada

claim paper

Read More »

204

Voted

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 8 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

« Prev « First page 25 / 90 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers