Search Sciweavers | Sciweavers

27 search results - page 5 / 6

» Application of reinforcement learning to the game of Othello

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

13 years 6 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

click to vote

EWCBR
2008
Springer

206views Automated Reasoning» more EWCBR 2008»

Forgetting Reinforced Cases

13 years 7 months ago

Download agora.ulaval.ca

To meet time constraints, a CBR system must control the time spent searching in the case base for a solution. In this paper, we presents the results of a case study comparing the p...

Houcine Romdhane, Luc Lamontagne

claim paper

Read More »

click to vote

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

13 years 10 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

click to vote

ROMAN
2007
IEEE

150views Robotics» more ROMAN 2007»

Asymmetric Interpretations of Positive and Negative Human Feedback for a Social Learning Agent

13 years 11 months ago

Download robotic.media.mit.edu

— The ability for people to interact with robots and teach them new skills will be crucial to the successful application of robots in everyday human environments. In order to des...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

click to vote

ECAI
2000
Springer

102views Artificial Intelligence» more ECAI 2000»

Learning to Use Operational Advice

13 years 9 months ago

Download home.in.tum.de

We address the problem of advice-taking in a given domain, in particular for building a game-playing program. Our approach to solving it strives for the application of machine lea...

Johannes Fürnkranz, Bernhard Pfahringer, Herm...

claim paper

Read More »

« Prev « First page 5 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers