Search Sciweavers | Sciweavers

1233 search results - page 160 / 247

» Reinforcement learning

156

click to vote

ECAI
2006
Springer

194views Artificial Intelligence» more ECAI 2006»

Strategic Foresighted Learning in Competitive Multi-Agent Games

15 years 9 months ago

Download homepages.cwi.nl

We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-agent games. We make the observation that in a competitive setting with adaptive...

Pieter Jan't Hoen, Sander M. Bohte, Han La Poutr&e...

claim paper

Read More »

211

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 8 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

175

click to vote

ICADL
2007
Springer

147views Education» more ICADL 2007»

Feature Reinforcement Approach to Poly-lingual Text Categorization

16 years 8 days ago

Download www.ischool.drexel.edu

With the rapid emergence and proliferation of Internet and the trend of globalization, a tremendous amount of textual documents written in different languages are electronically ac...

Chih-Ping Wei, Huihua Shi, Christopher C. Yang

claim paper

Read More »

209

click to vote

GECCO
2008
Springer

144views Optimization» more GECCO 2008»

Self-adaptive constructivism in Neural XCS and XCSF

15 years 7 months ago

Download www.cems.uwe.ac.uk

For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...

Gerard David Howard, Larry Bull, Pier Luca Lanzi

claim paper

Read More »

179

click to vote

CORR
2002
Springer

100views Education» more CORR 2002»

A neural model for multi-expert architectures

15 years 5 months ago

Download user.cs.tu-berlin.de

We present a generalization of conventional artificial neural networks that allows for a functional equivalence to multi-expert systems. The new model provides an architectural fr...

Marc Toussaint

claim paper

Read More »

« Prev « First page 160 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers