Search Sciweavers | Sciweavers

813 search results - page 103 / 163

» Ensemble Algorithms in Reinforcement Learning

131

click to vote

SIGIR
2003
ACM

116views Information Technology» more SIGIR 2003»

ReCoM: reinforcement clustering of multi-type interrelated data objects

15 years 9 months ago

Download research.microsoft.com

Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...

claim paper

Read More »

181

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 6 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

152

click to vote

ATAL
2004
Springer

102views Intelligent Agents» more ATAL 2004»

A Pheromone-Based Utility Model for Collaborative Foraging

15 years 9 months ago

Download cs.gmu.edu

Multi-agent research often borrows from biology, where remarkable examples of collective intelligence may be found. One interesting example is ant colonies’ use of pheromones as...

Liviu Panait, Sean Luke

claim paper

Read More »

135

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

14 years 11 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

133

click to vote

AIPRF
2007

116views Artificial Intelligence» more AIPRF 2007»

Evaluation of Different Approaches to Training a Genre Classifier

15 years 5 months ago

Download dis.ijs.si

This paper presents experiments on classifying web pages by genre. Firstly, a corpus of 1539 manually labeled web pages was prepared. Secondly, 502 genre features were selected ba...

Vedrana Vidulin, Mitja Lustrek, Matjaz Gams

claim paper

Read More »

« Prev « First page 103 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers