Search Sciweavers | Sciweavers

704 search results - page 22 / 141

» Learning the Ideal Evaluation Function

143

click to vote

ESANN
2008

278views Neural Networks» more ESANN 2008»

Learning to play Tetris applying reinforcement learning methods

15 years 2 months ago

Download www.dice.ucl.ac.be

In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...

Alexander Groß, Jan Friedland, Friedhelm Sch...

claim paper

Read More »

105

Voted

KDD
1998
ACM

112views Data Mining» more KDD 1998»

Evaluating Usefulness for Dynamic Classification

15 years 4 months ago

Download www.aaai.org

This paper develops the concept of usefulness in the context of supervised learning. We argue that usefulness can be used to improve the performance of classification rules (as me...

Gholamreza Nakhaeizadeh, Charles Taylor, Carsten L...

claim paper

Read More »

103

click to vote

GECCO
2004
Springer

155views Optimization» more GECCO 2004»

Genetic Network Programming with Reinforcement Learning and Its Performance Evaluation

15 years 6 months ago

Download www.cs.york.ac.uk

A new graph-based evolutionary algorithm named “Genetic Network Programming, GNP” has been proposed. GNP represents its solutions as directed graph structures, which can improv...

Shingo Mabu, Kotaro Hirasawa, Jinglu Hu

claim paper

Read More »

click to vote

PRL
2008

82views more PRL 2008»

Optimistic pruning for multiple instance learning

15 years 19 days ago

Download idea.cs.ou.edu

This paper introduces a simple evaluation function for multiple instance learning that admits an optimistic pruning strategy. We demonstrate comparable results to state of the art...

Amy McGovern, David Jensen

claim paper

Read More »

105

click to vote

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

15 years 11 days ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

« Prev « First page 22 / 141 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers