Search Sciweavers | Sciweavers

40 search results - page 6 / 8

» Learning Partially Observable Action Schemas

click to vote

ALT
2005
Springer

137views Machine Learning» more ALT 2005»

Defensive Universal Learning with Experts

14 years 2 months ago

Download www.idsia.ch

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedba...

Jan Poland, Marcus Hutter

claim paper

Read More »

click to vote

ML
2006
ACM

113views Machine Learning» more ML 2006»

Learning to bid in bridge

13 years 5 months ago

Download www.cs.technion.ac.il

Bridge bidding is considered to be one of the most difficult problems for game-playing programs. It involves four agents rather than two, including a cooperative agent. In additio...

Asaf Amit, Shaul Markovitch

claim paper

Read More »

click to vote

TSMC
2008

132views more TSMC 2008»

Ensemble Algorithms in Reinforcement Learning

13 years 5 months ago

Download people.cs.uu.nl

This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and fin...

Marco A. Wiering, Hado van Hasselt

claim paper

Read More »

click to vote

AIIDE
2009

297views Artificial Intelligence» more AIIDE 2009»

IMPLANT: An Integrated MDP and POMDP Learning AgeNT for Adaptive Games

13 years 3 months ago

Download www.comp.nus.edu.sg

This paper proposes an Integrated MDP and POMDP Learning AgeNT (IMPLANT) architecture for adaptation in modern games. The modern game world basically involves a human player actin...

Chek Tien Tan, Ho-Lun Cheng

claim paper

Read More »

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

13 years 6 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

« Prev « First page 6 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers