Search Sciweavers | Sciweavers

813 search results - page 136 / 163

» Ensemble Algorithms in Reinforcement Learning

154

Voted

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 5 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

133

click to vote

KDD
2009
ACM

150views Data Mining» more KDD 2009»

Information theoretic regularization for semi-supervised boosting

16 years 4 months ago

Download www.cs.wright.edu

We present novel semi-supervised boosting algorithms that incrementally build linear combinations of weak classifiers through generic functional gradient descent using both labele...

Lei Zheng, Shaojun Wang, Yan Liu, Chi-Hoon Lee

claim paper

Read More »

139

click to vote

IAT
2005
IEEE

138views Intelligent Agents» more IAT 2005»

Multiagent Reputation Management to Achieve Robust Software Using Redundancy

15 years 9 months ago

Download www.cse.sc.edu

This paper explains the building of robust software using multiagent reputation. One of the major goals of software engineering is to achieve robust software. Our hypothesis is th...

Rajesh Turlapati, Michael N. Huhns

claim paper

Read More »

143

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Integrating organizational control into multi-agent learning

15 years 10 months ago

Download www.aamas-conference.org

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

136

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 8 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

« Prev « First page 136 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers